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Description 

Technical Field 

[0001] The present invention relates to a molecule as- 
signing a genotype to a phenotype. More specifically, it 
relates to a molecule assigning a genotype to a pheno- 
type, comprising a nucleic acid portion having a nucle- 
otide sequence reflecting the genotype and a protein 
portion comprising a protein involved in exhibition of the 
phenotype. The molecule assigning the genotype to the 
phenotype of the present invention is a highly useful 
substance that can be utilized in evolutionary molecular 
engineering such as in the modification of enzymes, an- 
tibodies, ribozymes and other such functional biopoly- 
mers and creation of biopolymers having functions not 
found in living organisms. 

[0002] Through advances in biochemistry, molecular 
biology and biophysics, it has been learned that living 
organisms are molecular machines which function and 
propagate by interactions among molecules. Among the 
characteristics of earth's living organisms, the funda- 
mentals are their preservation of genetic information in 
DNA nucleotide sequences and their ability to translate 
this information into functional proteins through the me- 
dium of mRNA. Owing to progress in genetic engineer- 
ing, biopolymers with given sequences, like nucleotides 
and peptides, can now be easily synthesized. Protein 
engineering and RNA engineering, today a focus of at- 
tention, owe their existence to genetic engineering. The 
aims of protein engineering and RNA engineering are 
to solve the puzzle of the three-dimensional structures 
required for proteins and RNAf ulfilling specific functions 
and to enable humans to freely design proteins and RNA 
possessing desired functions. Because of the diversity 
and complexity of these structures and the difficulty of 
a theoretical approach to their three-dimensional struc- 
tures, however, current protein engineering and RNA 
engineering are still at the stage of modifying some of 
residues at active sites and observing changes in the 
structure and functions. Human knowledge has thus not 
yet reached the stage of designing proteins and RNA. 
[0003] Understanding the functions of biopolymers in 
their relationship to the elemental processes of higher 
life phenomena will require elucidation of the correlation 
between protein molecular structure and function. The 
line of thought we take in the following is not only to 
make the best of "human knowledge" but also to take 
advantage of the "wisdom of nature." This is because 
we concluded that we would have to acquire the ability 
to put both to work in order to overcome the current dif- 
ficulties of protein engineering and move forward with 
the design and production of functional biopolymers. 
When the classical methods are diverted to the design 
of proteins with newf unctions and activities, the difficulty 
of protein design by site-specific mutations can some- 
times be avoided. This can be called "taking advantage 
of the wisdom of nature." 



[0004] Although the drawback of this method is the 
difficulty of screening to identify mutants with new func- 
tions and activities, this difficulty is overcome by the 
RNA catalysts that have recently come into the spotlight. 

5 Attempts have been made to select an RNA with specific 
characteristics from among RNAs synthesized to have 
an extremely large number of random sequences (about 
10 13 types) (Ellington, A. D. & Szostak, J. W. (1990) Na- 
ture, 346, 818-822). 

10 [0005] This is an example of evolutionary molecular 
engineering. As typified by this example, the primary 
goal in the evolutionary molecular engineering of pro- 
teins is to find out optimum sequences by searching an 
expansive sequence space of a scale unimaginable in 

15 conventional protein engineering. By "making the best 
of human knowledge" to devise a screening system for 
this, it will be possible to discover numerous quasi-op- 
timum sequences around the optimum sequences and 
thus to construct an experimental system for studying 

20 "sequence vs function." 

[0006] The remarkable functions of living bodies were 
acquired through the process of evolution. Therefore, if 
evolution can be replicated, it should be possible to 
modify enzymes, antibodies, ribozymes and other func- 

25 tional biopolymers and, further, to create biopolymers 
with functions not found in living organisms in the labo- 
ratory. Needless to say, research on protein modification 
and creation is an object of utmost importance to various 
aspects of biotechnology such as utilization of enzymes 

30 as industrial catalysts, biochips. biosensors and sugar- 
chain engineering. 

[0007] Given the fact that molecular design utilizing 
structural theory is, as symbolized by the continuing 
high regard for "screening," still in an unperfected state, 

35 the evolutionary technique has a practical value for uti- 
lization in selecting useful proteins as a more efficient 
strategy. Building a "time machine" capable of more ef- 
ficiently producing evolution in a laboratory, if such were 
possible, would not only enable modification of en- 

40 zymes, antibodies (vaccines, monoclonal antibodies 
etc.) and other existing proteins but also open the way 
to the creation of enzymes for decomposing environ- 
mental contaminants, purifiers and others and new pro- 
teins not present in the biological world. If an experimen- 

45 tal system for protein evolution can be established, 
therefore, it can be expected to be aggressively utiliza- 
ble for application in a wide range of fields including 
power saving and energy preservation in industrial proc- 
esses, energy production and environmental preserva- 

50 tion. The assigning molecule of the present invention is 
a highly useful substance in protein modification and 
other aspects of evolutionary molecular engineering. 

Background Art 

55 

[0008] Evolutionary molecular engineering is a field of 
study that attempts to conduct molecular design of func- 
tional polymers by utilizing high-speed molecular evolu- 
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tion in the laboratory, i.e. , by laboratory investigation and 
optimization of the adaptive locomotion of biopolymers 
in sequence space. It is a completely new molecular bi- 
otechnology that first produced substantial results in 
1990 (Yuzuru Husimi (1991) Kagaku, 61 , 333-340; Yu- 
zuru Husimi (1992) Koza Shinka, Vol. 6, University of 
Tokyo Publishing Society). 

[0009] Life is a product of molecular evolution and nat- 
ural selection. The evolution of molecules is a universal 
life phenomenon but its mechanism is not something 
that can be elucidated by studies that track the history 
of past evolution. Rather, the approach of constructing 
and studying the behavior of simple molecules and life 
systems that evolve in the laboratory better provides 
fundamental knowledge regarding molecular evolution 
and enables establishment of a verifiable theory appli- 
cable in molecular engineering. 

[001 0] It is known that a polymer system will evolve if 
it satisfies the following five conditions: (1 ) an open sys- 
tem far out of equilibrium, (2) a self-replicative system, 
(3) a mutation system, (4) a system with genotype and 
phenotype assignment strategy, and (5) a system with 
appropriate adaptation topography in sequence space. 
(1) and (2) are conditions for occurrence of natural se- 
lection and (5) is determined beforehand by the physi- 
cochemical properties of the biopolymer. The genotype 
and phenotype assignment of (4) is a prerequisite for 
evolution by natural selection. 

[0011] The following three strategies are adopted in 
both the natural world and evolutionary molecular engi- 
neering: (a) ribozyme-type in which the genotype and 
the phenotype are carried on the same molecule, (b) vi- 
rus-type in which the genotype and the phenotype form 
a complex, and (c) a cell-type in which the genotype and 
the phenotype are contained in a single compartment 
(Figure 1). 

[001 2] As the ribozyme-type (a) in which the genotype 
and the phenotype are carried on the same molecule is 
a simple system, success with RNA catalysts (ri- 
bozymes) has already been reported (Hiroshi Yana- 
gawa (1993) New Age of RNA, pp. 55-77, Yodosha). 
[0013] Conceivable problem points of the eel I -type (c) 
are (1) the averaging effect, (2) the eccentricity effect 
and (3) the random replication effect. The averaging ef- 
fect arises because the assignment of the genotype to 
the phenotype statistically averages out and becomes 
ambiguous when the number of copies of the cell ge- 
nome is large. Since an evolved genome is only one 
among the number of copies in a cell (n). performance 
enhancement averages out and a struggle for existence 
in the cell population begins at selection coefficient (s)/ 
n. A smaller copy number (n) is therefore advantageous 
for the cell-type. Due to the presence of the eccentricity 
effect, however, when the number of segments is large, 
n must be very large to prevent the eccentricity effect. 
The apparent selection coefficient in the struggle for ex- 
istence in the cell population can therefore be expected 
to be very much smaller than in the case of the virus- 



type. Since thetime requiredforselection is proportional 
to the reciprocal of the selection coefficient, the rate of 
evolution is much slower than that of the virus-type. Fur- 
ther, the random replication effect (3) is fatal to the cell- 

5 type. This is because the random replication of seg- 
mented essential genes by this effect makes replication 
of all essential genes prior to cell division extremely dif- 
ficult. This means that even if an essential gene with an 
advantageous mutation should occur, the probability of 

10 its being replicated and passed on to a daughter cell is 
extremely low. 

[001 4] Uniting of the genotype and the phenotype as 
in the virus-type (b) is necessary for efficient evolution. 
[0015] Various techniques have already been pro- 
fs posed and are in the process of development for evolu- 
tionary molecular engineering of the virus-type (b) form- 
ing a complex of the genotype and the phenotype, in- 
cluding phage display (Smith, G. P. (1985) Science 228, 
1315-1317; Scott, J. K. & Smith, G. P. (1990) Science 
20 249, 386-390), polysome display (Mattheakis, L. C. et 
al. (1994) Proc. Natl. Acad. Sci. USA 91, 9022-9026), 
encoded combinatorial library (Brenner, S. & Lerner, R. 
A. (1992) Proc. Natl. Acad. Sci. USA 89, 5381-5383), 
and cellstat (Husimi, Y. et al. (1982) Rev. Sci. Instrum. 
25 53,517-522). 

[001 6] Despite the importance of the magnitude of the 
searchable sequence space in evolutionary molecular 
engineering, however, a method for globally searching 
a sequence space comparable to that of the ribozyme 
30 type has not yet been established for the virus-type. 
[0017] The reason for this is that viruses currently 
used in the method such as phage displays are para- 
sites of existing cells and are therefore unavoidably sub- 
ject to restraints imposed by the host cells, among which 
35 can be listed: (1 ) that only a limited sequence space can 
be searched owing to restriction by the cells, (2) mem- 
brane permeability, (3) bias due to host, and (4) limita- 
tion on library owing to host population. 
[0018] The polysome display method (Mattheakis, L. 
40 C. & Dower, W. J. (1995) W095/11922) joins a nucleic 
acid and a protein via a ribosome by non-covalent bond- 
ing. It is therefore suitable when the chain length at the 
peptide position is short but encounters handling prob- 
lems when the chain length is long as a protein. Since 
45 the huge ribosome remains interposed, the conditions 
at the time of the selection operation (e.g., adsorption, 
elution or the like) are subjected to severe restriction. 
The encoded combinatorial library (Janda, F. H. & Lern- 
er, R. A. (1 996) W096/22391) assigns a chemically syn- 
50 thesized peptide to a nucleic acid tag via beads. Since 
the yield of chemical synthesis of proteins with around 
1 00 residues is extremely poor using currently available 
technologies, however, this technique can be used with 
short chain-length peptides but not with long chain- 
55 length proteins. 

[001 9] One conceivable method of overcoming these 
problems is use of a cell-free translation system. A virus- 
type strategy molecule that simply binds the genotype 
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and the phenotype in the cell-free systems has a 
number of advantages including the following: (1) that 
a huge mutant population approaching that of the ri- 
bozyme-type can be synthesized, (2) creation of various 
proteins without dependence on a host, (3) no problem 
regarding membrane permeability, and (4) that the 21 st 
code can be used to introduce a non-native amino acid. 

Description of the Invention 

[0020] An object of the present invention is to provide 
a molecule comprising a virus-type operation replicon 
which has the advantages of the aforementioned virus- 
type strategy molecule, exhibits a higher efficiency than 
phages, and suffers fewer limitations concerning envi- 
ronmental condition setting, namely, a molecule which 
should be called "in vitro virus", wherein a nucleic acid 
and a protein are bound by a chemical bond, that is, a 
molecule in which a genotype is assigned to a pheno- 
type. More specifically, the present invention has been 
accomplished in order to provide a molecule exhibiting 
one-on-one relationship between information and func- 
tion, which can be utilized for creation of functional pro- 
teins and peptides, by performing genotype (nucleic ac- 
id) assignment to phenotype (protein) using a cell-free 
protein synthesis system, and binding the 3'-terminal 
end of a gene to the C-terminal end of a protein with a 
covalent bond on ribosome. Further, it is also an object 
of the present invention to obtain target functional pro- 
teins or peptides through investigation of vast sequence 
space, which is performed by repetition of selection of 
molecules that assign genotypes to phenotypes formed 
as described above (also referred to as "in vitro virus" 
hereinafter) by the in vitro selection method, and ampli- 
fication of gene portions of the selected in vitro viruses 
by the reverse transcription PCR, and further amplifica- 
tion while introducing mutations. 
[0021] The present inventors earnestly conducted in- 
vestigations to achieve the aforementioned objects, and 
as a result, they found that two kinds of molecules that 
assign a genotype to a phenotype, comprising a nucleic 
acid and a protein which were chemically bound can be 
constructed on a ribosome in a cell-free protein synthe- 
sis system. They further found that a protein evolution 
simulation system can be constructed wherein the as- 
signing molecules (in vitro viruses) were selected by the 
in vitro selection method, gene portions of the selected 
in vitro viruses were amplified by reverse transcription 
PCR, and the genes were further amplified while intro- 
ducing mutations. The present invention has been ac- 
complished based on these findings. 
[0022] Thus the present invention provides a mole- 
cule assigning a genotype to a phenotype, which com- 
prises a nucleic acid portion having a nucleotide se- 
quence reflecting the genotype, and a protein portion 
comprising a protein involved in exhibition of the pheno- 
type, the nucleic acid portion and the protein portion be- 
ing directly bound by a chemical bond. 



[0023] According to preferred embodiments of the 
present invention, there are provided the aforemen- 
tioned assigning molecule wherein a 3 -terminal end of 
the nucleic acid portion and a C-terminal end of the pro- 
5 tein portion are bound by a covalent bond, and the afore- 
mentioned assigning molecule wherein a 3-terminal 
end of the nucleic acid portion covalently bound to a C- 
terminal end of the protein portion is puromycin. 
[0024] According to another preferred embodiment of 
10 the present invention, there is also provided the afore- 
mentioned assigning molecule wherein the nucleic acid 
portion comprises a gene encoding a protein, and the 
protein portion is a translation product of the gene of the 
nucleic acid portion. The nucleic acid portion preferably 
15 comprises a gene composed of RNA, and a suppressor 
tRNA bonded to the gene through a spacer. The sup- 
pressor tRNA preferably comprises an anticodon corre- 
sponding to a termination codon of the gene. Alterna- 
tively, the nucleic acid portion may comprise a gene 
20 composed of RNA, and a spacer portion composed of 
DNA and RNA, or DNA and polyethylene glycol. The nu- 
cleic acid portion may comprise a gene composed of 
DNA, and a spacer portion composed of DNA and RNA. 
[0025] As further aspects of the present invention, 
25 there are provided a method for constructing a molecule 
assigning a genotype to a phenotype, which comprises 
(a) boding a DNA comprising a sequence corresponding 
to a suppressor tRNA, to a 3'-terminal end of a DNA con- 
taining a gene through a spacer, (b) transcribing the ob- 
30 tained DNA bonded product into RNA, (c) bonding, to a 
3'-terminal end of the obtained RNA, a nucleoside or a 
substance having a chemical structure analogous to 
that of a nucleoside, which can be covalently bonded to 
an amino acid or a substance having a chemical struc- 
35 ture analogous to that of an amino acid, and (d) perform- 
ing protein synthesis in a cell-free protein synthesis sys- 
tem using the obtained bonded product as mRNA to 
bond a nucleic acid portion containing the gene to a 
translation product of the gene; and a method for con- 
40 structing a molecule assigning a genotype to a pheno- 
type, which comprises (a) preparing a DNA containing 
a gene which has no termination codon, (b) transcribing 
the prepared DNA into RNA, (c) bonding a chimeric 
spacer composed of DNA and RNA to a 3'-terminal end 
45 of the obtained RNA, (d) bonding, to a 3'-terminal end 
of the obtained bonded product, a nucleoside or a sub- 
stance having a chemical structure analogous to that of 
a nucleoside, which can be covalently bondedto an ami- 
no acid or a substance having a chemical structure anal- 
50 ogous to that of an amino acid, and (e) performing pro- 
tein synthesis in a cell-free protein synthesis system us- 
ing the obtained bonded product as mRNA to bond a 
nucleic acid portion containing the gene to a translation 
product of the gene. 
55 [0026] According to a preferred embodiment of the 
present invention, there is provided the aforementioned 
construction method wherein the nucleoside orthe sub- 
stance having the chemical structure analogous to that 
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of the nucleoside is puromycin. 
[0027] As another aspect of the present invention, 
there is provided a method for constructing a molecule 
assigning a genotype to a phenotype, which comprises 
(a) preparing a DNA containing a gene which has no 
termination codon, (b) transcribing the prepared DNA 
into RNA, (c) bonding a chimeric spacer composed of 
DNA and polyethylene glycol to a 3'-terminal end of the 
obtained RNA ; (d) bonding, to a 3'-terminal end of the 
obtained bonded product, a nucleoside or a substance 
having a chemical structure analogous to that of a nu- 
cleoside, which can be covalently bound to an amino 
acid or a substance having a chemical structure analo- 
gous to that of an amino acid, and (e) performing protein 
synthesis in a cell-free protein synthesis system using 
the obtained bonded product as mRN Ato bond a nucleic 
acid portion containing the gene to a translation product 
of the gene. 

[0028] As another aspect of the present invention, 
there is provided a method for constructing a molecule 
assigning a genotype to a phenotype, which comprises 
(a) preparing a DNA containing a gene which has no 
termination codon, (b) transcribing the prepared DNA 
into RNA, (c) bonding a spacer composed of double- 
stranded DNA to a 3'-terminal end of the obtained RNA, 
(d) bonding, to a 3'-terminal end of the obtained bonded 
product, a nucleoside or a substance having a chemical 
structure analogous to that of a nucleoside, which can 
be covalently bound to an amino acid or a substance 
having a chemical structure analogous to that of an ami- 
no acid, and (e) performing protein synthesis in a cell- 
free protein synthesis system using the obtained bond- 
ed product as m RNA to bond a nucleic acid portion con- 
taining the gene to a translation product of the gene. 
[0029] As a further aspect of the present invention, 
there is provided a method for constructing a molecule 
assigning a genotype to a phenotype, which comprises 
(a) preparing a DNA containing a gene which has no a 
termination codon, and a nucleotide sequence of a 
spacer, (b) transcribing the prepared DNA into RNA, (c) 
bonding, to a3'-terminal end of the obtained RNA, a nu- 
cleoside or a substance having a chemical structure 
analogous to that of a nucleoside, which can be cova- 
lently bonded to an amino acid or a substance having a 
chemical structure analogous to that of an amino acid, 
(d) adding a short chain PNA or DNA to a 3'-terminal 
end side portion of the gene in the obtained RNA bonded 
product to form a double-stranded chain, and (e) per- 
forming protein synthesis in a cell-free protein synthesis 
system using the obtained bonded product as mRNAto 
bond a nucleic acid portion containing the gene to a 
translation product of the gene. 

[0030] As a still further aspect of the present inven- 
tion, there is provided a method for protein evolution 
simulation, which comprises a construction step for con- 
structing assigning molecules from a DNA containing a 
gene by any one of the construction methods mentioned 
above, a selection step for selecting the assigning mol- 



ecules obtained in the construction step, a mutation in- 
troduction step for introducing a mutation into a gene 
portion of an assigning molecule selected in the selec- 
tion step, and an amplification step for amplifying the 
5 gene portion obtained in the mutation introduction step. 
In the method for evolution simulation, the construction 
step, the selection step, the mutation introduction step 
and the amplification step are preferably performed re- 
peatedly by providing the DNA obtained in the amplifi- 
10 cation step to the construction step. Further, there is pro- 
vided an apparatus for performing the aforementioned 
method for evolution simulation, which comprises a 
means for constructing assigning molecules, said 
means comprising a first bonding means for bonding a 
15 DNA comprising a sequence corresponding to a sup- 
pressor tRNA to a 3'-terminal end of a DNA containing 
a gene through a spacer, a transcription means for tran- 
scribing the DNA bonded product obtained by the first 
bonding means into RNA, a second bonding means for 
20 bonding, to a 3'-terminal end of the RNA obtained by a 
transcription means, a nucleoside or a substance hav- 
ing a chemical structure analogous to that of a nucleo- 
side, which can be covalently bound to an amino acid 
or a substance having a chemical structure analogous 
25 to that of an amino acid, and a third bonding means for 
performing protein synthesis in a cell-free protein syn- 
thesis system using the bonded product obtained by the 
second bonding means as mRNAto bond a nucleic acid 
portion containing the gene to a translation product of 
30 the gene, or a means for constructing assigning mole- 
cules, said means comprising a transcription means for 
transcribing a DNA containing a gene into RNA, a first 
bonding means for bonding a chimeric spacer com- 
posed of DNA and RNA, a chimeric spacer composed 
35 of DNA and polyethylene glycol, a double-stranded 
spacer composed of DNA and DNA, or a double-strand- 
ed spacer composed of RNA and a short chain peptide 
nucleic acid (PNA) or DNA to a 3'-terminal end of the 
RNA obtained by the transcription means, a second 
40 bonding means for bonding, to a 3'-terminal end of the 
RNA-spacer bonded obtained by the first bonding 
means, a nucleoside or a substance having a chemical 
structure analogous to that of a nucleoside, which can 
be covalently bound to an amino acid or a substance 
45 having a chemical structure analogous to that of an ami- 
no acid, and a third bonding means for performing pro- 
tein synthesis in a cell-free protein synthesis system us- 
ing the bonded product obtained by the second bonding 
means as mRNAto bond a nucleic acid portion contain- 
50 ing the gene to a translation product of the gene; a se- 
lection means for selecting the constructed assigning 
molecules; a mutation introduction means for introduc- 
ing a mutation into a gene portion of an assigning mol- 
ecule selected; and an amplification means for amplify- 
55 jng the gene portion to which the mutation is introduced. 
[0031] As a still further aspect of the present inven- 
tion, there is provided a method for assaying protein/ 
protein or protein/nucleic acid intermolecular action, 
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which comprises a construction step for constructing as- 
signing molecules by any one of the aforementioned 
construction methods, and an assay step for examining 
intermolecular action of the assigning molecules ob- 
tained in the construction step with another protein or 5 
nucleic acid. 

Brief Description of the Drawings 

[0032] 10 

Figure 1 shows strategies for genotype (nucleic ac- 
id portion) assignment to phenotype (protein por- 
tion). 

Figure 2 shows a method for construction of the 15 
molecule assigning the genotype to the phenotype 
of the present invention wherein a nucleic acid por- 
tion and a protein portion are bonded in a site-di- 
rected manner. 

Figure 3 shows chemically-modified portions of the 20 
3'-terminal ends of nucleic acid portions, which are 
a point of the construction of the molecule assigning 
the genotype to the phenotype (in vitro virus). 
Figure 4 shows a method for construction of the 
molecule assigning the genotype to the phenotype 25 
of the present invention wherein a nucleic acid por- 
tion and a protein portion are bonded in a non-site- 
directed manner. 

Figure 5 is a photograph of electrophoresis image 
that shows spacer optimization in the site-directed 30 
method. It shows the results of 4% polyacrylamide 
gel electrophoresis (in the presence of 8 M urea) of 
a DNA obtained through a process comprising 
translation of each RNA genome having a spacer 
in a length corresponding to each of the prepared 35 
fractions a, b. and c in the presence of a biotinylated 
lysyl tRNA in an E. coli cell-free translation system, 
specific absorption on streptavidin -coated magnet- 
ic beads, reverse transcription, and amplification by 
PCR (staining was silver staining). Lane 1 isforthe 40 
spacer length of fraction a (255-306 residues), Lane 
2 is for the spacer length of fraction 2 (1 02-238 res- 
idues), and Lane 3 is for the spacer length of fraction 
c (0-85 residues). 

Figure 6 is a photograph of electrophoresis image 45 
showing bonding of a nucleic acid portion and a pro- 
tein portion in a site-directed method. The results 
were obtained by 18% polyacrylamide gel electro- 
phoresis (in the presence of 8 M urea and SDS): 
Lane 1 for a translation product of mRNA encoding 50 
the 4 repeats region of atau protein, which was ob- 
tained in an E. co//cell-free translation system while 
labeled with [ 35 S]-methionine, and Lane 2 for a 
translation product of the mRNA whose 3'-terminal 
end was bonded to sup tRNA having puromycin, 55 
and whose 5'-terminal end was labeled with [ 32 P], 
which was obtained in an E. co// cell-free translation 
system. 



Figure 7 is a photograph of electrophoresis image 
showing bonding of nucleic acid portion and protein 
portion in the non-site-directed method. The results 
were obtained by 18% polyacrylamide gel electro- 
phoresis (in the presence of SDS): Lane 1 for a 
translation product of mRNA encoding the 4 repeats 
region of a tau protein, which was obtained in an E. 
coli cell-free translation system, while labeled with 
[ 35 S]-methionine, and Lane 2 for a translation prod- 
uct of the mRNA whose 3'-terminal end was bonded 
through a spacer to puromycin labeled with [ 32 P] at 
the 5' end, which was obtained in an E. coli cell-free 
translation system, and Lane 3 for the translation 
product of Lane 2 digested with ribonuclease T2. 
Figure 8 shows an example of the method for con- 
structing the molecule assigning the genotype to 
the phenotype (in vitro virus) according to the 
present invention. 

Figure 9 is a photograph of electrophoresis image 
showing bonding of rCpPurto the C-terminal of the 
N-terminal half (1-165) of human tau protein. Three 
kinds of genomes, i.e., one having a stop codon but 
not a DNA spacer (the first lane from the left), one 
having neither of a stop codon and a DNA spacer 
(the second lane from the left), and one not having 
a stop codon but having a DNA spacer (the third 
lane from the left) each atthe3'-terminal end of mR- 
NA encoding the N-terminal half (1 -1 65) of human 
tau protein, were constructed, and translated in a 
cell-free translation system utilizing rabbit reticulo- 
cyte lysate in the presence of rCpPur labeled 
with 32 P at 30°C for 20 minutes. The translation 
products were analyzed by 11.25% SDS-PAGE. 
The lane at the right end shows the result for a prod- 
uct obtained by translation of the mRNA encoding 
the N-terminal half of human tau protein (1-165) in 
the presence of [ 35 S]-methionine under the same 
condition mentioned above. 

Figure 1 0 is a photograph of electrophoresis image 
showing generation of in vitro viruses in a cell-free 
translation system. (A) shows a time course of gen- 
eration of in vitro viruses. A genome composed of 
the mRNA encoding the N-terminal half of human 
tau protein (1-165), a DNA spacer (105 mer), a pep- 
tide acceptor, and rCpPur was translated in a cell- 
free translation system utilizing rabbit reticulocyte 
lysate and containing [ 35 S]-methionine, and the 
translation product was examined in a time course 
(at 5 minutes, 1 0 minutes, 20 minutes, and 40 min- 
utes) at 30°C. The translation products were ana- 
lyzed by 1 1 .25% SDS-PAGE. The first lane from the 
left shows the result obtained by using the RNA en- 
coding the N-terminal half of human tau protein 
(1-165) as mRNA, and examining incorporation of 
[ 35 S]-methionine into the protein under the same 
condition as mentioned above. The lane at the right 
end shows the result of in vitro virus genome la- 
beled with 32 P. (B) shows influence of concentration 
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of in vitro virus genome for the generation of in vitro 
viruses. Lane 1 shows the results for a genome la- 
beled with [ 32 P]-rCpPur at the 3'-terminal end, Lane 
2 for a genome (1 .2 \ig) to which rCpPur was at- 
tached at its 3'-terminal end, Lane 3 for a genome 
(0.33 jig) to which rCpPur was attached at its 3'- 
terminal end, and Lane 4 for a genome (0.64 t ug) to 
which rCpPur was attached at its 3'-terminal end. 
As for Lanes 2-4, the genomes were translated in a 
cell-free translation system utilizing rabbit reticulo- 
cyte lysate and containing [ 35 S]-methionine at 30°C 
for 20 minutes. The translation products were ana- 
lyzed by 11 .25% SDS-PAGE. 
Figure 11 is a photograph of electrophoresis image 
showing generation of in vitro viruses in a cell-free 
translation system. An in vitro virus genome com- 
posed of the mRNA encoding the N-terminal half 
(1-165) of human tau protein, a DNA spacer (105 
mer), a peptide acceptor, and [ 32 P]-rCpPur was 
translated by utilizing rabbit reticulocyte lysate at 
30°C for 20 minutes. The translation products were 
analyzed by 11.25% SDS-PAGE. The bonding of 
the genome and the protein could be confirmed by 
digestion with mung bean nuclease. When the 
translation product (Lane 3) was digested with 
mung bean nuclease, bands appeared (Lane 4) at 
the locations corresponding to monomer and dimer 
(Lane 1 ) of the N-terminal half of human tau protein 
(1 -1 65). Lane 2 shows the result for an in vitro virus 
genome labeled with 32 P. 

Figure 12 shows process steps of a protein evolu- 
tion simulation method utilizing in vitro viruses. 

Best Mode for Carrying out the Invention 

[0033] In this specification, some technical terms are 
used, and those technical terms have the following 
meanings when herein used. The term "nucleic acid por- 
tion" means a bonded product of a nucleoside or a sub- 
stance having a chemical structure analogous to a nu- 
cleoside, for example, RNA, DNA, PNA (peptide nucleic 
acid; polymers comprising nucleic acids linked via ami- 
no acid analogues) and the like, and "protein portion" 
means a bonded product of an amino acid or a sub- 
stance having a chemical structure analogous to an ami- 
no acid such as naturally-occurring amino acids and 
non-naturally-occurring amino acids. The term "sup- 
pressor tRNA (sup tRNA)" means a tRNA which can 
suppress mutation by structural change, for example, 
reading a termination codon on mRNA as a codon cor- 
responding to a certain amino acid. The expression of 
"having a nucleotide sequence reflecting genotype" 
means to contain a gene or a part thereof relating to a 
genotype. The expression of "containing a protein in- 
volved in exhibition of phenotype" means to contain, for 
example, a protein whose expression itself is a charac- 
teristic of phenotype, a protein involved in exhibition of 
a characteristic of phenotype by its function as an en- 



zyme or the like. 

[0034] The spacer located at the 3'-terminal end side 
of the nucleic acid portion may be any spacer provided 
that it is a polymer substance preferably having a length 

5 of not less than 100A, more preferably about 100 to 
1000A. Specifically, single-stranded chains of RNA or 
DNA, double-stranded chains of DNA and DNA. double- 
stranded chains of RNA and short chain PNA or DNA 
(e.g., about 15 to 25 nucleotides), and polymer materials 

10 such as polysaccharides, which are naturally-occurring 
orsynthetic, synthetic organic polymer substances such 
as polyethylene glycols, preferably polyethylene glycols 
having a molecular weight of about 3,000 to 30,000 and 
the like can be mentioned. 

15 [0035] The nucleic acid portion and the protein portion 
of the assigning molecule of the present invention are 
linked through a chemical bond such as a covalent 
bond. In particular, preferred are those formed by bond- 
ing a nucleoside or a substance having a chemical struc- 

20 ture analogous to a nucleoside, or a linked product 
thereof present at the 3'-terminal end of the nucleic acid 
portion to an amino acid or a substance having a chem- 
ical structure analogous to an amino acid present at the 
C-terminal end of the protein portion via a chemical 

25 bond, for example, a covalent bond. 

[0036] For the bonding between the nucleic acid por- 
tion and the protein portion, for example, puromycin, 
3-N-aminoacylpuromycin aminonucleoside (PANS- 
amino acid), which have an amide bond as the chemical 

30 bond at the 3'-terminal end of the nucleic acid portion, 
e.g., PANS-Gly wherein the amino acid portion is gly- 
cine, PANS-Val wherein the amino acid portion is valine, 
PANS-Ala wherein the amino acid portion is alanine, 
and further PANS-(any of the other amino acids) where- 

35 in the amino acid portion is an of the other amino acids, 
can be utilized. 3'-A/-Aminoacyladenosine aminonucle- 
oside (AANS-amino acid), which comprises as the 
chemical bond an amide bond formed by dehydration 
condensation of the amino group of 3 -aminoadenosine 

40 and the carboxyl group of an amino acid, for example, 
AANS-Gly wherein the amino acid portion is glycine, 
AANS-Val wherein the amino acid portion is valine, 
AANS-Ala wherein the amino acid portion is alanine, 
and further AANS-(any of the other amino acids) where- 

45 in the amino acid portion is any of the other amino acids, 
can also be utilized. Those composed of a nucleoside 
or a nucleoside bound to an amino acid via an ester 
bond may also be used. Further, any other materials 
having a binding mode capable of binding a nucleoside 

50 or a substance having a chemical structure analogous 
to a nucleoside and an amino acid or a substance having 
a chemical structure analogous to an amino acid can 
also be utilized. 

[0037] The molecule assigning the genotype to the 
55 phenotype of the present invention can be constructed 
by, for example, (1 ) a method where the binding of the 
nucleic acid portion and the protein portion is formed in 
a site-directed manner, or (2) a method where the bond- 
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ing of the nucleic acid portion and the protein portion is 
formed in a non-site-directed manner, which will be ex- 
plained hereinafter. 

[0038] First, (1) the method where the bonding of the 
nucleic acid portion and the protein portion is formed in 
a site-directed manner will be explained. 
[0039] In this method, a molecule assigning a geno- 
type to a phenotype can be constructed by (a) bonding 
a DNA comprising a sequence corresponding to sup 
tRNA, to the 3'-terminal end of a DNA containing a gene 
through a spacer (b) transcribing the obtained DNA 
bonded product into RNA, (c) bonding. tothe3'-terminal 
end of the obtained RNA, a nucleoside or a substance 
having a chemical structure analogous to that of a nu- 
cleoside, which can be covalently bound to an amino 
acid or a substance having a chemical structure analo- 
gous to that of an amino acid, e.g., puromycin, (d) per- 
forming protein synthesis in a cell-free protein synthesis 
system, e.g., an E. coli cell-free protein synthesis sys- 
tem, using the obtained bonded product as mRNA, and 
thus (e) affording a molecule assigning a genotype to a 
phenotype comprising a gene RNA (genotype) and a 
protein (phenotype) which is a translation product of the 
gene, which are chemically bound through a nucleoside 
or a substance having a chemical structure analogous 
to that of a nucleoside, e.g., puromycin. 
[0040] That is, according to this method of the present 
invention, when a termination codon comes into the A 
site of ribosome during the protein synthesis, a sup 
tRNA is correspondingly incorporated, and a nucleoside 
or a substance having a chemical structure analogous 
to that of a nucleoside, e.g., puromycin, present at the 
3'-terminal end of the sup tRNA is bound to a protein by 
the action of peptidyl transferase (Figure 2). Therefore, 
this method is site-directed as for the formation of the 
bonding between the nucleic acid portion and the pro- 
tein, which depends on the genetic code. 
[0041] It has been known that puromycin (Figure 3) 
inhibits the protein synthesis in bacteria (Nathans, D. 
(1 964) Proc. Natl. Acad. Sci. USA, 51 , 585-592; Takeda, 
Y. et al. (1960) J. Biochem. 48, 169-177) and animal 
cells (Ferguson, J. J. (1 962) Biochim. Biophys. Acta 57, 
616-617; Nemeth, A. M. & de la Haba, G. L. (1962) J. 
Biol. Chem. 237, 1190-1193). Puromycin, whose struc- 
ture resembles the structure of aminoacyl tRNA, reacts 
with peptidyl tRNA bound to the P site of ribosome, and 
it is released from ribosome as peptidyl puromycin, and 
thus interrupts the protein synthesis (Harris, R.J. (1971) 
Biochim. Biophys. Acta 240, 244-262). 
[0042] It is not practical to purify native sup tRNA and 
bond it to mRNA, because of the problems concerning 
the purification of sup tRNA and the easily hydrolyzable 
ester bond at the 3'-terminal end of tRNA. Through in- 
vestigations of tRNA identity, it has been elucidated that 
unmodified tRNA may be aminoacylated like intact 
tRNA, and that the aminoacylated unmodified tRNA 
may be taken into ribosome, and translated (Shimizu, 
M. et al. (1 992) J. Mol. Evol. 35, 436-443). The identity 



of tRNA is also utilized in order to prepare sup tRNA. 
[0043] It has been reported that the aminoacyl syn- 
thetases of alanine, histidine, and leucine do not recog- 
nize the anticodons thereof (Tamura, K. et al. (1 991 ) J. 
5 Mol. Recog. 4, 129-132). Therefore, it can be expected 
that, by replacing the anticodon of tRNA for alanine with 
atermination codon (e.g., amber), tRNAfor alanine (sup 
tRNA) would be incorporated corresponding to the ter- 
mination codon. 

[0044] In this respect, it comes into question whether 
tRNA whose 5'-terminal-end side is not made up by 
RNAse P or the like, unlike ordinary tRNA, may enter 
into the A site of ribosome or not. This is the most im- 
portant problem to be investigated in determining feasi- 
bility of the model of the present invention. It has been 
known that the 3'-terminal ends of Brome Mosaic Virus 
(BMV) and Turnip Yellow Mosaic Virus (TYMV) have a 
tRNA-like structure, and they are aminoacylated by ami- 
noacyl synthetase, and incorporated at an efficiency of 
1% in a cell-free translation system (Chen, J. M. & Hall, 
T. C. (1973) Biochemistry 12, 4570-4574). Supposing 
that RNA of BMV is incorporated even by 1% by ribos- 
ome, it can be expected that RNA having intact tRNA at 
its 3'-terminal end may be incorporated more efficiently. 
Even if it is incorporated at an efficiency of 10% or less 
of that of intact tRNA. there is a reasonable possibility 
that it can win the competition with the release factor by 
the concentration effect. 

[0045] Therefore, before the experiment for bonding 
a protein to the 3'-terminal end of mRN A-sup tRNA (mR- 
NA ligated at its 3'-terminal end with sup tRNA through 
a spacer), it was examined whether even sup tRNA sep- 
arated from mRNA entered into the A site of ribosome 
and was bound to a protein. A sup tRNA whose 3'-ter- 
minal end was bonded to puromycin was actually pre- 
pared, and added to a cell-free protein synthesis system 
to examine whether the sup tRNA portion entered into 
the A site of ribosome corresponding to occurrence of 
a termination codon and bound to a protein. The 4 re- 
peats region of tau protein (127 residues) was used as 
mRNA (Goedert, M. (1989) EMBO J. 8, 392-399). As a 
result, when the translation was performed in a cell-free 
protein synthesis system, it could be confirmed that the 
sup tRNA having puromycin at its 3'-terminal end was 
incorporated into the A site of ribosome corresponding 
to atermination codon and bound to a protein (Figure 2). 
[0046] Then , RNA-sup tRNA bonded products having 
different lengths of the spacer between mRNA and sup 
tRNA were constructed, and it was attempted to select 
an optimum length of the spacer which afforded the best 
efficiency of the incorporation of the sup tRNA portion 
into the A site of ribosome by the in vitro selection meth- 
od. As a result, it was found that the RNA-sup tRNA 
bonded product having a certain spacer length was 
chemically bound to a protein that was a translation 
product thereof with a good efficiency. 
[0047] In order to construct the molecule assigning 
the genotype to the phenotype of the present invention, 
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a nucleoside or a substance having a chemical structure 
analogous to that of a nucleoside, which is to be bonded 
to the 3'-terminal end of the nucleic acid portion, and 
can becovalentlyboundtoan amino acid or a substance 
having a chemical structure analogous to that of an ami- 
no acid, e.g., 2'-deoxycytidylylpuromycin (dCpPur) and 
ribocytidylpuromycin (rCpPur) (Figure 3), must be syn- 
thesized first. 

[0048] An exemplary method for synthesizing dCpPur 
is as follows. First, puromycin-5 -monophosphate can 
be prepared by chemically phosphorylating the 5 '-hy- 
droxy I group of puromycin using phosphorus oxychlo- 
ride and trimethylphosphate. Then, the amino group of 
the amino acid portion and the 2 -hydroxyl group of the 
ribose portion in puromycin-5 -monophosphate can be 
protected by reacting puromycin-5 -monophosphate 
with trifluoroacetic acid and trifluoroacetic anhydride. 
The protected product can be reacted with Bz-DMT de- 
oxycytidine in which the amino group of the pyrimidine 
ring and the 5'-hydroxyl group of the ribose portion in 
deoxycytidine are protected, in the presence of a con- 
densation agent, dicyclohexylcarbodiimide, and then 
deprotected with acetic acid and ammonia to afford 2'- 
deoxycytidylylpuromycin (dCpPur). pdCpPurcan be ob- 
tained by phosphorylating the 5'-hydroxyl group of dCp- 
Pur with polynucleotide kinase. 

[0049] The ribocytidylpuromycin can be prepared by 
condensing puromycin and rC-p-amidite having protec- 
tive groups in the presence of tetrazole, and oxidizing 
and deprotecting the product. 

[0050] Then, the construction of a bonded product 
constituting the nucleic acid portion for binding the nu- 
cleic acid portion and the protein portion in a site-direct- 
ed manner will be described hereinafter. 
[0051 ] As the bonded product constituting the nucleic 
acid portion used for the site-directed method, for ex- 
ample, a bonded product comprising 5 -(T7 promoter re- 
gion)-(Shine-Dalgarno (SD) sequence region)-(mRNA 
region)-(spacer region)-(sup tRNA region)-(puromycin 
region)-3' connected in this order in sequence can be 
mentioned. 

[0052] In the construction of this bonded product for 
the nucleic acid portion, a plasmid comprising the 4 re- 
peats region, which is a microtuble-binding region of hu- 
man tau protein called htau24 (Goedert, M. (1989) EM- 
BO J. 8, 392-399), inserted downstream of T7 promoter 
(pAR3040) is constructed first, and it is digested with 
restriction enzymes BglW and BamH\ to afford a linear 
DNA. This DNA is used as a template, and amplification 
is carried out by PCR by using primers for upstream re- 
gion containing T7 region (forward) and for downstream 
region containing the SD region and a region around the 
initiation codon (backward), and Taq DNA polymerase. 
[0053] In the above method, three methionines may 
be added to the backward primer in order to enhance 
detection sensitivity for radioactive methionine in the 
protein portion after the protein synthesis. That is, leu- 
cine at position 4, and lysines at positions 5 and 8 of the 



4 repeats region are replaced with methionines. Even- 
tually, the translated 4 repeats protein contains four me- 
thionines in total. Then, amplification is carried out by 
PCR using a DNA containing the aforementioned line- 

5 arized 4 repeats region as a template, a complementary 
chain of the backward primer mentioned above as a for- 
ward primer, and a backward primer which is designed 
sothattheC-terminal end of the4 repeats region should 
have an amber codon as the termination codon. 

10 [0054] The two kinds of DNA fragments amplified by 
the PCR, namely, the DNA fragment containing the T7 
promoter and the SD region and the DNA fragment con- 
taining the 4 repeats region are mixed, initially extended 
without primers, and then amplified by PCR again by 

15 using a primer containing the sequence of T7 promoter 
as the forward primer, and a primer containing a termi- 
nation codon at the C-terminal of the 4 repeats region 
as the backward primer. 

[0055] This DNA bonded product (T7 promoter-SD-4 

20 repeats) is ligated to a double-stranded DNA fragment 
having cohesive ends at the both ends and composed 
of 17 residues in tandem by using DNA ligase to afford 
ligation products having different spacer lengths. 
[0056] After the ligation, the product was fractionated 

25 into three fractions (a, b, c) based on the length by poly- 
acrylamide gel electrophoresis (PAGE). The spacer is 
represented as (1 7)n wherein n = 1 5 to 1 8 forthe fraction 
a, n = 6 to 14 for the fraction b, and n = 0 to 5 for the 
fraction c. As the sup tRNA, a native alanyl tRNA whose 

30 several sites and anticodon are modified into amber 
(UAG) is prepared by chemical synthesis. This sup 
tRNA is ligated to the ligation products of the fractions 
a, b and c having different spacer lengths by using T4 
DNA ligase. Forthe ligation site, an excessive amount 

35 of a single-stranded backing DNA is used, and after 
once melted by temperature elevation, the strands are 
annealed, and a complementary strand is formed, and 
ligated. After the ligation, the ligation product is ampli- 
fied by PCR using primers for the 5 'end and 3 -terminal 

40 end of the ligation product. This DNA ligation product is 
transcribed by using T7 RNA polymerase to form an 
RNA ligation product. 

[0057] By ligating the pdCpPur chemically synthe- 
sized in the above to the 3 -terminal end of this RNA 

45 ligation product using T4 RNA ligase, there can be ob- 
tained an RNA ligation product, 5'-(T7 promoter region) 
-(SD region)-(4 repeats region)-(spacer region)-(sup 
tRNA region)-(puromycin)-3', which can be used as a 
gene in a cell-free protein synthesis system. 

50 [0058] The protein synthesis is performed by adding 
the above RNA ligation product as mRNAto a cell-free 
protein synthesis system such as cell-free protein syn- 
thesis extracts of E. coliov rabbit reticulocytes. In order 
to obtain an optimum spacer length for obtaining the 

55 most efficient bonding of the nucleic acid portion (RNA) 
and the protein portion, the following experiment is per- 
formed. 

[0059] That is, the protein synthesis is performed in a 
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cell-free protein synthesis system by using aforemen- 
tioned RNA ligation products having three kinds of dif- 
ferent spacer lengths, corresponding to the fractions a, 
b and c, as the gene. In this synthesis, by adding tRNA 
charged with modified lysine comprising biotin bound 
through the e-amino group of the lysine, biotinyl lysine is 
incorporated in several positions of lysine residues in 
the translated 4 repeats protein. After the protein syn- 
thesis, magnetic beads coated with streptavidin on their 
surfaces are added to isolate the protein incorporating 
the biotin. 

[0060] If the nucleic acid portion (RNA) has bonded 
to the protein portion through puromycin, the nucleic ac- 
id portion (RNA) should be bound to the C-terminal end 
of the protein. When reverse transcription was per- 
formed by using a sequence corresponding to the N- 
terminal region of the 4 repeats as a forward primer and 
the 3'-terminal end portion of sup tRNA as a backward 
primer and analyzed by polyacrylamide gel electro- 
phoresis to confirm whether the RNA-protein-bonded 
product was actually picked up by the magnetic beads, 
a band of reverse transcribed DNA was observed only 
for the spacer length of the fraction c. This means that 
the RNA ligation product having the spacer length of the 
fraction c is most efficiently bound to the protein portion. 
[0061] Now, it will be explained about (2) the method 
where the bonding of the nucleic acid portion and the 
protein portion is formed in a non-site-directed manner. 
[0062] In this method, a molecule assigning a geno- 
type to a phenotype can be constructed by (a) preparing 
a DNA containing a gene which has no termination co- 
don, (b) transcribing the prepared DNA into RNA, (c) 
bonding a chimeric spacer composed of DNA and RNA 
to the 3'-terminal end of the obtained RNA. (d) bonding, 
to the 3'-terminal end of the bonded product, a nucleo- 
side or a substance having a chemical structure analo- 
gous to that of a nucleoside, which can be covalently 
bound to an amino acid or a substance having a chem- 
ical structure analogous to that of an amino acid, e.g., 
puromycin, (e) performing protein synthesis in a cell- 
free protein synthesis system using the obtained bond- 
ed product as mRNA, and thus (f) affording a molecule 
assigning a genotype to a phenotype comprising a gene 
RNA and a protein which is a translation product of the 
gene, which are chemically bound through puromycin 
or the like. 

[0063] That is, according to this method of the present 
invention, a nucleoside or a substance having a chem- 
ical structure analogous to that of a nucleoside, e.g., 
puromycin, present at the 3'-terminal end of the nucleic 
acid portion does not enter into the A site of ribosome 
corresponding to the termination codon of mRNA on ri- 
bosome, but randomly enters depending on the spacer 
length, and puromycin or the like at the 3'-terminal end 
of the RNA-DNA chimera nucleic acid portion is chem- 
ically bound to a protein by the action of peptidyl trans- 
ferase (Figure 4). Therefore, this method is non-site-di- 
rected as for the formation of the bonding between the 



nucleic acid portion and the protein, which does not de- 
pend on the genetic code. 

[0064] In this method, the molecule assigning the 
genotype to the phenotype can be constructed by using 
5 a non-site-directed ligation product for the nucleic acid 
portion in the same manner as in the aforementioned 
site-directed method of (1). 

[0065] As the ligation product constituting the nucleic 
acid portion used for the n on -site- directed method, for 
10 example, a ligation product composed of 5'-(T7 promot- 
er region)-(Shine-Dalgarno(SD) sequence region)-(mR- 
NA region)-(spacer region)-(puromycin region)-3' con- 
nected in this order in sequence can be mentioned. 
[0066] In the construction of this ligation product for 
15 the nucleic acid portion, the construction from the T7 
promoter region to the end of the 4 repeats region may 
be similar to that explained for the construction of the 
ligation product for the nucleic acid portion used in (1) 
the site directed method mentioned above, provided 
20 that a primer designed not to have a termination codon 
by replacing the two termination codons at the C-termi- 
nal end of the 4 repeats, ochre (CTG) and amber (TAA), 
with CAG (glutamine) and AAA (lysine), respectively, is 
used as a backward primer for the PCR amplification of 
25 the ligation product constructed above used as a tem- 
plate. 

[0067] This DNA ligation product is transcribed as a 
template by using T7 RNA polymerase to afford a cor- 
responding RNA ligation product. This single-stranded 
30 RNA ligation product is separately ligated to each of sin- 
gle-stranded chemically-synthesized DNA linkers 
(chain length; 20, 40, 60, and 80 nucleotides) by using 
T4 RNA ligase. Then, each ligation product is ligated to 
a single-stranded DNA-RNA chimeric oligonucleotide 
35 comprising 25 residues (DNA; 21 residues, RNA; 4 res- 
idues), which is designated as peptide acceptor, by us- 
ing T4 DNA ligase in the presence of a single-stranded 
backing DNA. 

[0068] Because the sequence of the peptide acceptor 
40 contains the 3'-terminal end sequence of alanyl tRNA, 
and it enhances the incorporation of a puromycin deriv- 
ative into the A site of ribosome, it is preferable to use 
the peptide acceptor between the spacer region and the 
puromycin region. 
45 [0069] By ligating the pdCpPur chemically synthe- 
sized in the above to the 3'-terminal end of the above 
ligation product using T4 RNA ligase, there can be ob- 
tained an RNA-DNA chimeric ligation product, 5'-(T7 
promoter region (RNA))-(SD region (RNA))-(4 repeats 
50 region (RNA))-(spacer region (DNA))-(peptide acceptor 
region)-(puromycin)-3\ which can be used as a gene for 
a cell-free protein synthesis system. 
[0070] If protein synthesis is performed by using the 
RNA-DNA chimeric ligation product mentioned above 
55 as a gene in a cell-free protein synthesis system, there 
can be obtained a bonded product comprising a nucleic 
acid portion (RNA-DNA chimeric ligation product, gen- 
otype) and a protein portion (phenotype), which are con- 
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nected by a chemical bond through puromycin. 
[0071] In the aforementioned method, a chimeric 
spacer of DNA and polyethylene glycol can also be used 
instead of the chimeric spacer of DNA and RNA. 
[0072] In the above method, a spacer composed of a 
double-stranded chain of DNA and DNA, or a double- 
stranded chain of RNA and short chain PNA or DNA (e. 
g. , about 1 5 to 25 nucleotides) may also be used instead 
of the chimeric spacer of DNA and RNA. The spacer 
composed of the double strand of DNA and DNA is not 
necessary to be double-stranded in its full length, and it 
may be double-stranded for most part (usually, several 
residues at the both ends are single-stranded, and re- 
maining portion is double-stranded). The double-strand- 
ed spacer composed of RNA and short chain PNA or 
DNA can also be prepared by (a) preparing a DNA con- 
taining a gene which has no termination codon, and a 
nucleotide sequence of a spacer, (b) transcribing the 
prepared DNA into RNA, (c) bonding, to the 3'-terminal 
end of the obtained RNA, a nucleoside or a substance 
having a chemical structure analogous to that of a nu- 
cleoside, which can be covalently bound to an amino 
acid or a substance having a chemical structure analo- 
gous to that of an amino acid, and (d) adding a short 
chain PNA or DNA to a 3'-terminal-end side portion of 
the gene in the obtained RNA bonded product to form 
a double-stranded chain. 

[0073] The genetic engineering techniques men- 
tioned in the present specification such as isolation and 
preparation of nucleic acids, ligation of nucleic acids, 
synthesis of nucleic acids, PCR, construction of plas- 
mids, and translation in cell-free system can be per- 
formed by the methods described in Sambrook et al. 
(1 989) Molecular Cloning, 2nd Edition, Cold Spring Har- 
bor Laboratory Press, or similar methods unless other- 
wise indicated. 

[0074] The assigning molecule of the present inven- 
tion can also be obtained by successively bonding each 
of the elements by any known chemical bonding meth- 
ods in addition to the methods exemplified above. 
[0075] The protein evolution simulation method of the 
present invention is a method comprising steps of (1) 
construction of in vitro virus genomes, (2) completion of 
in vitro viruses, (3) selection process, (4) introduction of 
mutation, and (5) amplification, as shown in Figure 12. 
These steps or repetition of these steps as required al- 
lows modification and creation of functional proteins. 
Among these steps, the steps of (1) and (2) can be per- 
formed by the construction methods explained above in 
detail. That is, the step (1) corresponds to construction 
of the bonded product comprising the nucleoside or the 
substance having the chemical structure analogous to 
that of the nucleoside, and the step (2) corresponds to 
the construction of the assigning molecule from the 
bonded product. The steps of (3), (4) and (5) will be de- 
scribed hereinafter. 

[0076] The selection process of (3) means a process 
of evaluating function (biological activity) of protein por- 



tions constituting the in vitro viruses, and selecting in 
vitro viruses based on a desired biological activity. Such 
a process has been known, and described in, for exam- 
ple, Scott, J. K. & Smith, G. P. (1990) Science, 249, 

5 386-390; Devlin, P. E. et al. (1990) Science, 249, 
404-406; Mattheakis, L. C. etal. (1994) Proc. Natl. Acad. 
Sci. USA, 91 , 9022-9026 and the like. 
[0077] Then, mutations are introduced into the nucleic 
acid portions of the selected in vitro viruses, and the in 

10 vitro viruses are amplified by PCR orthe like in the steps 
of (4) introduction of mutation, and (5) amplification. 
When the nucleic acid portion of in vitro viruses is com- 
posed of RNA, mutation can be introduced after a cDNA 
is synthesized by reverse transcriptase. The amplifica- 

15 tion of the nucleic acid portion may also be performed 
while introducing mutation. The introduction of mutation 
can be readily performed by already established error- 
prone PCR (Leung, D. W., et al., (1 989) J. Methods Cell 
Mol. Biol., 1, 11-15), Sexual PCR (Stemmer, W. P. C. 

20 (1 994) Proc. Natl. Acad. Sci. USA 91 , 1 0747-1 0751 ) or 
the like. 

[0078] (1) In vitro virus genomes can be constructed 
by using nucleic acid portions for in vitro viruses which 
have been introduced with mutation and amplified, (2) 

25 in vitro viruses can be completed by using the in vitro 
virus genomes, (3) in vitro viruses can be selected 
based on a desired biological activity, and (4) mutation 
introduction and amplification can be carried out. By re- 
peating these steps as required, modification and crea- 

30 tion of functional proteins can be realized. 

[0079] The means contained in the apparatus of the 
present invention for performing the aforementioned 
protein evolution simulation method themselves are 
known ones, and operations in these means such as 

35 addition of reagents, stirring, temperature control, and 
evaluation of biological activity can be performed ac- 
cording to the methods known per se. By combining 
these operations, an automatic or semi-automatic ap- 
paratus of the present invention can be constructed. 

40 [0080] The step of constructing assigning molecules 
in the method for assaying protein/protein or protein/nu- 
cleic acid intermolecular action of the present invention 
generally comprises steps of (1) synthesizing mRNA 
from a gene library or a cDNA library, and constructing 

45 an in vitro genome, and (2) constructing an in vitro virus 
comprising mRNA and a corresponding protein, which 
are bonded on ribosome, by utilizing a cell-free protein 
synthesis system. 

[0081] The step (1 ) corresponds to synthesis of mR- 
50 NA using RNA polymerase from cDNA of DNA of which 
sequence has been known and which contains a se- 
quence corresponding to ORF, cDNA of DNA of which 
sequence is unknown and which contains a fragment 
resulting from fragmentation with a suitable restriction 
55 enzyme orthe like, and construction of an in vitro virus 
genome by utilizing the mRNA. 

[0082] The steps of above (1) in vitro virus genome 
construction, and (2) in vitro virus construction can be 
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performed by the construction methods explained 
above in detail. 

[0083] The assay step for examining intermolecular 
action between assigning molecules and other proteins 
or nucleic acids (DNA or RNA) usually comprises steps 
of (3) selecting only proteins having a particularfunction 
from the in vitro viruses constructed in the step (2), and 
(4) subjecting selected in vitro viruses to reverse tran- 
scription, amplification, and sequencing. 
[0084] In the step (3), target proteins or nucleic acids 
(DNA or RNA) and other substances, for example, sac- 
charides and lipids, are bound to a microplate, beads or 
the like beforehand through covalent bonds or non-cov- 
alent bonds, and the in vitro viruses constructed in the 
step (2) are added thereto, to react under a certain tem- 
perature condition for a certain period of time, and it is 
washed to remove in vitro viruses which has not been 
boundto the target. Then, the in vitro viruses which have 
been bound to the target are released. This step can be 
performed by the already-established ELISA (Enzyme 
Linked Immunosorbent Assay, Crowther, J. R. (1995) 
Methods in Molecular Biology, Vol. 42, Humana Press 
Inc.) or a similartechnique. 

[0085] In the step (4), the in vitro viruses released in 
the step (3) are reverse-transcribed and amplified by re- 
verse transcription PCR, and the amplified DNA was se- 
quenced directly or after cloning. 
[0086] According to the assay method of the present 
invention, it becomes possible to identify a function of a 
gene product (protein) corresponding to a gene whose 
function is unknown by (1) synthesizing mRNA from 
gene DNA whose sequence is known or unknown to 
construct in vitro virus genomes, (2) constructing in vitro 
viruses by using the in vitro virus genomes, (3) selecting 
only those binding to a target protein or nucleic acid or 
other substances, for example, saccharides and lipids 
from among the in vitro viruses, and (4) subjecting se- 
lected in vitro viruses to reverse transcription, amplifica- 
tion, cloning and sequencing. 

[0087] To perform the method for assaying intermo- 
lecular action mentioned above, an apparatus can be 
constructed by combining known appropriate means. 
The means contained in the apparatus may be per se 
known ones, and operations in these means such as 
addition of reagents, stirring, temperature control, and 
evaluation of biological activity can be performed ac- 
cording to the methods known per se. By combining 
these operations, an automatic or semi-automatic ap- 
paratus for assaying intermolecular action can be con- 
structed. 

Examples 

[0088] The present invention will be more specifically 
explained with reference to the following examples. 
However, the following examples should be construed 
to be an aid for obtaining more specific understanding 
of the present invention, and the scope of the present 



invention is not no way limited by these examples. 

Example 1 : Preparation of in vitro virus (1) 

5 <1> Preparation of 3'-terminal-end portion of nucleic 
acid portion 

(a) Synthesis of phosphorylated puromycin (pPur) 
Materials: 

10 

[0089] Puromycin (3'-[cx-amino-p-methoxyhydrocin- 
namamido]-3'-deoxy-/V,/V-dimethyl-adenosine) was 
purchased from Sigma. Phosphorus oxychloride, and 
trimethyl phosphate were purchased from Wako Pure 
15 Chemicals. 

Methods: 

[0090] A solution formed by mixing phosphorus oxy- 
20 chloride (1.5 mmol) and trimethyl phosphate (11.4 
mmol) was ice-cooled, and puromycin (0.3 mmol) was 
added thereto to mix sufficiently, and the mixture was 
allowed to react at 0°Cfor 7 hours (Yoshikawa, M. et al. 
(1969) Bull. Chem. Soc. Jap. 42, 3505-3508). The re- 
25 action mixture was then added to an ice-cooled mixture 
of acetone (40 ml) and ether (20 ml) containing sodium 
perchlorate (NaCI0 4 , 0.4 g), and stirred sufficiently. 
Then, water (720 ml) was added to the mixture, and the 
mixture was stirred at 4°C for 24 hours to hydrolyze the 
30 chlorine group. The product precipitated after the hy- 
drolysis was separated by centrifugation, and washed 
with acetone and ether. The resulting white powder was 
dried in vacuo to afford phosphorylated puromycin with 
a yield of 70 to 90% based on the puromycin. 

35 

(b) Protection of phosphorylated puromycin by 
acetylation Materials: 

[0091] Trifluoroacetic acid (TFA) was purchased from 
40 Nacalai Tesque. Trifluoroacetic anhydride (TFAA) was 
purchased from Wako Pure Chemicals. 

Methods: 

45 [0092] The dried phosphorylated puromycin (0.2 
mmol) and TFA (5 ml) were mixed, and TFAA (2 ml) was 
added to the mixture at -1 0°C, followed by stirring. The 
mixture was allowed to react at room temperature for 1 
hour with stirring (Weygand, F. & Gieger, R. (1956) 
50 Chem. Ber. 89, 647-652). The reaction was quenched 
by adding water (50 ml), and the TFA was removed by 
repeating a procedure comprising addition of water (10 
ml) and evaporation to dryness under reduced pressure 
5 times. Finally, water (50 ml) was added to the resulting 
55 product, followed by lyophilization to afford phosphor- 
ylated puromycin in which the amino group of the amino 
acid portion in puromycin and the 2 -hydroxyl group of 
the ribose portion were protected with acetyl groups with 
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a yield of 50 to 60% based on the phosphorylated puro- 
mycin. 

(c) Synthesis of dCpPur (2 , -deoxycytidyl(3'^5 l ) 
puromycin) Materials: 5 

[0093] BZ-DMT deoxycytidine (A^-benzoyl^ 1 - 
0-(4,4'-dimethoxytrityl)-2 -deoxycytidine) was pur- 
chased from Sigma, and DCC (dicyclohexyl carbodiim- 
ide) was purchased from Watanabe Chemical. The py- 10 
ridine was purchased from Nacalai Tesque. 

Methods: 

[0094] The phosphorylated puromycin protected with is 
acetyl groups (40 jamol) and Bz-DMT deoxycytidine 
(600 jxmol) were dehydrated by repeating a procedure 
comprising addition of pyridine (2 ml) and evaporation 
to dryness three times, and finally pyridine (2 ml) was 
added thereto. DCC (400 jxmol) was added to the mix- 20 
ture with stirring, and the mixture was allowed to react 
at room temperature for 3 days to 2 weeks (Ralph, R. 
K. et al. (1965) J. Am. Chem. Soc. 87, 5661-5670; and 
Harris, R. J. et al. (1 972) Can. J. Biochem. 50, 91 8-926). 
After the reaction , the DMT group was removed by a 25 
reaction with 80% acetic acid (5 ml)fortwo hours. Then, 
the acetyl group was removed by a reaction with con- 
centrated aqueous ammonia/ethanol (6 ml, volume ra- 
tio: 2:1) at 20° C for 2 days. The concentrated aqueous 
ammonia was removed by evaporation under reduced 30 
pressure, and the residue was dissolved in water (40 
ml). The resulting solution was applied on a column 
packed with QAE-Sephadex A-25 (Pharmacia) and ad- 
sorbed thereon. Fractions containing the target product 
were eluted with 0.5 M triethylamine carbonate (TEAB, 35 
pH 7.5), lyophilized ; and finally separated by HPLC to 
afford deprotected dCpPur with a yield of 1 to 5% based 
on the puromycin. 

<2> Preparation of nucleic acid portion (in vitro virus 40 
genome) 

[0095] Two kinds of in vitro virus genomes, i.e.. (1) 
one for bonding a nucleic acid portion and a protein por- 
tion in a site-directed manner, and (2) one for bonding 45 
a nucleic acid portion and a protein portion in a non-site- 
directed manner, were prepared. 

Materials: 

50 

[0096] An E. co//cell-free protein synthesis system (E. 
colt S30 Extract System for Linear Templates) was pur- 
chased from Promega. T7 RNA polymerase, T4 DNA 
ligase, T4 DNA kinase, human placenta ribonuclease 
inhibitor, EcoRI, Bam\-\\, and deoxyribonucleotides were 55 
purchased from Takara Shuzo. Restriction enzymes 
BstN\ and Bgf\\ were purchased from New England 
Labs. As for [ 35 S]-methionine, and [y- 32 P]-ATP, those 



from Amersham, andasforTaq DNA polymerase, those 
from Kurabo and Grainer were used. As for the other 
biochemical reagents, those from Sigma and Wako 
Pure Chemicals were used. A plasmid containing the 
microtuble-binding region of human tau protein (4 re- 
peats) (pAR3040) was prepared by picking up the full 
length gene of human tau protein from a cDNA library 
of human brain cloned in X ZAPII by PCR, introducing 
the gene into a plasmid : amplifying only the 4 repeats 
region in the plasmid, and introducing the amplified 
product into a plasmid. As the PCR (polymerase chain 
reaction) apparatuses, Model PTC-100 (MJ Research) 
and Model ASTEC PC800 (Astec) were used. 

(1 ) Preparation of genome for site-directed bonding 

A. Preparation of DNA for mutated 4 repeats portion 

[0097] 

1) A plasmid (pAR3040) comprising microtuble re- 
gion (4 repeats) of human tau protein (Goedert, M. 
(1989) EMBO J. 8, 392-399) was constructed, and 
linearized by digestion with restriction enzymes 
Bgl\\ and BamH\. 

2) The 4 repeats portion containing the T7 promoter 
region and the Shine-Dalgarno sequence was am- 
plified by PCR from the genome prepared above. 
For this amplification, as primers, Left+ (SEQ ID 
NO: 1) was used for 5' side, and Right-(SEQ ID NO: 

2) for the 3' side. Right- had such a sequence that 
the leucine before the ochre termination codon 
should be mutated into amber termination codon. 
The PCR conditions were 92°C/30 seconds for de- 
naturation, 65°C/30 seconds for annealing, and 
73°C/1 minute for elongation, and this cycle was re- 
peated 30 times. 

3) Then, the amplified genome was purified, and 
mutated by utilizing PCR in order to promote the in- 
corporation of methionine and hence enhance de- 
tection of radioactive isotope. That is, primers Left- 
(SEQ ID NO: 3), and Right+ (SEQ ID NO: 4) con- 
taining a region desired to be mutated were synthe- 
sized. First, using the DNA of the above 2) as a tem- 
plate, it was amplified by PCR with primers Left+ 
and Left-, and the amplified DNA was designated 
as "Left". Amplification by PCR was also performed 
with primers Right+, and Right-, and the amplified 
DNA was designated as "Right". After 5% polyacr- 
ylamide denatured gel electrophoresis, "Left" and 
"Right" were excised form the gel, and extracted. 
The excised Left and Right were first amplified by 
PCR under the same conditions as mentioned 
above without primers. Further. 1 jxl taken from this 
reaction mixture was used as a template, and it was 
amplified by PCR under the same conditions with 
primers Left-h and Right-. From the above proce- 
dure, DNA of the mutated 4 repeats portion in which 
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the number of methionine was increased from one 
to four was prepared. 

B. Ligation of alanine suppressor tRN A (Ala-sup tRNA) 
containing spacers having different lengths to 4 repeats 
portion 

[0098] 

1 ) The BamW site located in the 3'-terminal-end se- 
quence of the above 4 repeats portion obtained in 
the above A was digested with BamH~\ . Then, to re- 
move 3'-terminal-end fragment at the BamW site, 
only the 4 repeats portion in 5 -terminal-end se- 
quence was extracted and purified by using 
QIAquick PCR Purification Kit (QIAGEN). 

2) The purified product of the above 1), and Spacer- 
A (SEQ ID NO: 5) which was phosphorylated at 5 - 
terminal end by T4 kinase were ligated by using T4 
DNA ligase while they were backed with Spacer-B 
(SEQ ID NO: 6). 

3) Spacer-C (SEQ ID NO: 7) which was phosphor- 
ylated by T4 kinase, and Spacer-B which had a re- 
gion complementary to Spacer-C were ligated by 
using T4 DNA ligase. The reaction was performed 
at 15°Cfor2 hours. Then, the product was purified 
by ethanol precipitation. 

4) The products of the above sections 2) and 3), 
Spacer-D (SEQ ID NO: 8), and sup tRNA (SEQ ID 
NO: 9) which was phosphorylated at 5'-terminal end 
were dissolved in T4 DNA ligase buffer, denatured 
at 85°C for 2 minutes : and cooled on ice. After ad- 
dition of T4 DNA ligase, it was allowed to react at 
15°C for 2 hours, and subjected to phenol extrac- 
tion, and ethanol precipitation. 

5) The product obtained in the above 4) was used 
as a template, and amplification was carried out by 
PCR using the primer Lef t+ , and a primer 3'Pur- 
(SEQ ID NO: 10) under the conditions of 92°C/30 
seconds for denaturation, 65°C/30 seconds for an- 
nealing, and 73°C/1 minute for elongation, which 
cycle was repeated 30 times. The product was sub- 
jected to polyacrylamide denatured gel electro- 
phoresis. Three regions A, B and C exhibiting dif- 
ferent migration distances were excised, and DNA 
was extracted from the regions. 

6) The DNAs extracted from A, B and C in the above 
5) and having different lengths were used as tem- 
plates and amplification were again carried out by 
PCR under the same conditions, and lengths of the 
products were determined by electrophoresis, and 
they were used as template DNA for transcription. 
As a result, it was found that the numbers of Spac- 
er-C inserted into each product of the fractions were 
0-5 for the fraction c, 6-14 for the fraction b, and 
15-18 for the fraction a. 



C. Preparation of RNA genome and ligation of dCpPur 

[0099] The regions A ; B and C obtained in the above 
B was transcribed into RNA at 37° for 2 hours by using 

5 T7 polymerase. Further, the dCpPur obtained in the 
above <1 > Preparation of 3'-terminal-end portion of the 
nucleic acid portion was phosphorylated in the presence 
of ATP by using T4 polynucleotide kinase at 1 5°C for 24 
hours, and ligated to the aforementioned transcribed 

10 RNA genome by using T4 RNA ligase at 4°C for 50 
hours. From this procedure, an RNA genome compris- 
ing sup tRNA having puromycin at its 3'-terminal end 
could be constructed. 

15 (2) Preparation of genomefor non-site-directed bonding 

A. Preparation of DNA and RNA for mutated 4 repeats 
portion 

20 [01 00] DNA of the mutated 4 repeats portion was pre- 
pared principally the same method as the aforemen- 
tioned (1) A. However, the termination codons were 
eliminated by changing the two termination codons, am- 
ber and ochre, to glutamine and lysine, respectively, and 
25 a new primer New/Right- (SEQ ID NO: 1 0) was synthe- 
sized in order to make the 3'-terminal-end sequence pu- 
rine-rich , and used with Left+ for PCR amplification . The 
amplification by PCR was performed under the condi- 
tions of 92°C/30 seconds for denaturation , 65°C/30 see- 
so onds for annealing, and 73°C/1 minute for elongation, 
which cycle was repeated 30 times. This DNA was used 
as a template to obtain an RNA genome through a re- 
action at 37°Cfor2 hours utilizing T7 polymerase. 

35 B. Ligation of Spacers 1 to 4 

[0101] After a reaction at 36°C for 1 hourwithT4 poly- 
nucleotide kinase, of Spacer 1 which was a DNA com- 
posed of 21 nucleotides (SEQ ID NO: 11), Spacer 2 

40 which was a DNA composed of 40 nucleotides (SEQ ID 
NO: 12). Spacer 3 which was a DNA composed of 60 
nucleotides (SEQ ID NO: 13), or Spacer 4 which was a 
DNA composed of 80 nucleotides (SEQ ID NO: 14), the 
RNA obtained in the above A was ligated to the spacers 

45 through a reaction at 10°C for 48 hours using T4 RNA 
ligase. 

C. Ligation of peptide acceptor (P -Acceptor) 

50 [0102] A peptide acceptor (P-Acceptor, SEQ ID NO: 
15), which was a chimeric nucleic acid composed of 
21 -nucleotide DNA and 4-nucleotide RNA, i.e., 25 nu- 
cleotides in total, was synthesized in order to enhance 
the incorporation efficiency into ribosomes by ligating it 

55 at its 3'-terminal end to dCpPur. To phosphorylate the 
5'-terminal end of P-Acceptor, it was reacted at 36° C for 
1 hour using T4 polynucleotide kinase. Then, the prod- 
uct was backed with Back3' (SEQ ID NO: 1 6) having a 
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complementary sequence thereto, and ligated to the 3'- 
terminal end of each of the spacers prepared in the 
aforementioned B through a reaction at 1 6°Cfor2 hours 
using T4 RNA ligase. This P-Acceptor was also directly 
ligated to the 3'-terminal end of the RNA obtained in the 5 
above A through a reaction at 1 0°C for 48 hours using 
T4 RNA ligase ; and the product was designated as Non- 
Spacer genome. 

D. Ligation of dCpPur 

[0103] The dCpPur obtained in the above <1 > Prep- 
aration of 3'-terminal-end portion of nucleic acid portion 
was phosphorylated by using T4 polynucleotide kinase 
at 15°C for 24 hours, and ligated to the 3'-terminal end 
of each of the genomes prepared in the above C by us- 
ing T4 RNA ligase at 4°C for 50 hours. By this proce- 
dure, chimeric RNA genomes comprising puromycin at 
its 3'-terminal end could be constructed. 

<3> Optimization of nucleic acid portion 

A. Site-directed method 

[0104] Each of the RNA genomes prepared in the 
above <2>, (1), which were classified into each of the 
lengths corresponding to the fractions a, b and c, was 
translated in 50 \i\ of E. coli eel I -free translation system 
[E. coli S30 Extract Systems for Linear Templates 
(Promega)] containing biotinylated lysyl tRNA (Prome- 
ga), and after addition of 5 mg of streptavidin coated 
magnetic beads (Dynabeads, Dynal) to each reaction 
tube, it was incubated at room temperature for 1 hour. 
Then, the Dynabeads were collected by a magnet, and 
the supernatant was aspirated. The remained Dyna- 
beads were washed 2 times with B&W buffer (1 000 uJ). 
The beads were further washed twice with RT-PCR buff- 
er (500 ^l), and resuspended in RT-PCR buffer (500 jal). 
The suspension (50 was transferred into a 500-ul Ep- 
pendorf tube, and after the Dynabeads were immobi- 
lized with a magnet, the supernatant was aspirated. To 
the remained Dynabeads, RT-PCR buffer, reverse tran- 
scriptase, and Taq polymerase [Access RT-PCR Sys- 
tem (Promega)] were added. Reverse transcription was 
performed at48°Cfor 1 hours, and PCR was performed 
under the conditions of 94°C/30 seconds, 65°C/40 sec- 
onds, and 68°C/1 minute and 40 seconds, which cycle 
was repeated 40 times, using primers of Right+ (SEQ 
ID NO: 4) and 3'Pur- (SEQ ID NO: 10). The results of 
the analysis of the fractions a, b and c by electrophoresis 
were shown is Figure 5. 

[0105] A band was detected from the group of the 
fraction c (Lane 3 in Figure 5). This band was separated 
from the gel by electrophoresis, ligated to "Left" having 
theT7 promoter and the Shine-Dalgarno region by PCR, 
and amplified by PCR using the primers Left+ (SEQ ID 
NO: 3) and 3'Pur- (SEQ ID NO: 10). This genome was 
designated as "Stranger". 



[0106] Then, to examine whether the protein actually 
translated from Stranger was bonded to the mRNA por- 
tion (RNA genome portion), after transcription, it was 
ligated with pdCpPurat its 3'-terminal end using T4 RNA 
ligase, dephosphorylated at 5'-terminal end of RNA us- 
ing HK phosphatase (Epicentre) at 30°C for 1 hour, and 
labeled with [y- 32 P]-ATP using T4 polynucleotide ki- 
nase. The product was added to an E. coli cell-free 
translation system as mRNA, and allowed to react at 
37°C for 1 hour and 40 minutes. The results of 18% 
SDS-PAGE of the product are shown in Figure 6. From 
the results, it can be seen that the nucleic acid portion 
(genotype) and the protein portion (phenotype) were 
bonded to form in vitro viruses, i.e., molecules that as- 
sign a genotype to a phenotype, at a rate of about 80% 
or more. 

B. N on -site-directed method 

[0107] Because a short spacer was already used in 
the site-directed method, dCpPur, which had been 
phosphorylated at 5 -terminal end in the presence of 
[y- 32 P]-ATP using T4 polynucleotide kinase, was ligated 
to the 3 -terminal end of "Non-spacer" RNA genome 
without a spacerthrough a reaction using T4 RNA ligase 
at 4° C for 50 hours. The product was added to an E. coli 
cell-free translation system together with mRNA encod- 
ing the ordinary 4 repeats, and allowed to react at 37°C 
for 1 hour and 30 minutes. This reaction mixture (10 
digested with ribonuclease T2, and an equal amount of 
the reaction mixture were electrophoresed by 18% 
SDS-PAGE, and analyzed by an image analyzer 
BAS2000 (Fujifilm) (Figure 7). 

[0108] As a result, as for the sample digested with ri- 
bonuclease T2, a band appeared at the same migration 
distance as that of the control, the 4 repeats labeled with 
[ 35 S]-methionine, because the sample contained the re- 
leased protein portion. On the other hand, as for the 
sample not treated, a band appeared above the 4 re- 
peats protein, i.e., it was found that the band reflected 
a clearly larger molecular weight. This band did not cor- 
respond to the labeled mRNA itself (about 400 nucle- 
otides), because it migrated a longer distance than the 
tRNA. Therefore, it was identified as a substance com- 
posed of bonded RNA and protein. That is, these results 
demonstrated that the nucleic acid portion was bonded 
to the protein portion in a non-site-directed manner. 

Example 2: Preparation of in vitro virus (2) 

<1> Preparation of 3'-terminal-end portion of nucleic 
acid portion 

(a) Synthesis of rCpPur (ribocytidylp'^S^puromycin 
Materials: 

[0109] Each material was purchased from the follow- 
ing manufacturers: Puromycin from Sigma, rC-0- 
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amidite (A/ 4 -benzoyl-5'-0-(4,4 , -dimethoxytrityl)-2 , -0- 
tert-butyldimethylsilyl)-cytidine-3'-0-[0-(2-cyanoethyl) 
-/V,/V-diisopropyl-phosphoramidite]) from Japan 
PerSeptive, tetrazole from Japan Millipore, tetrabuty- 
lammonium fluoride from Aldrich, QAE-Sephadex from 
Pharmacia, and silica gel for chromatography from Mer- 
ck. 

Methods: 

[0110] Puromycin (50 mg ; 92 ujtioI) was dissolved in 
dry pyridine (2 ml), and dehydrated by evaporation un- 
der reduced pressure. This procedure was repeated 
three times. To this, 4% tetrazole solution in acetonitrile 
(1 5 ml) was added, and the mixture was stirred at room 
temperature. The reaction was monitored by silica gel 
thin layer, chromatography (TLC, developing solvent: 
chloroform:methanol = 9:1). The reaction was usually 
finished in a day. After the reaction, the solvent was re- 
moved under reduced pressure. To the residue, 0.1 M 
solution of iodine in tetrahydrofuran/pyri dine/water (80: 
40:2, 3 ml) was added, and the formed phosphite triester 
was oxidized with stirring at room temperature. One and 
a half hours later, the solvent was removed under re- 
duced pressure, and the residue was extracted with 
chloroform. The extract was dried over anhydrous mag- 
nesium sulfate, and the solvent was removed under re- 
duced pressure. The residue was subjected to silica gel 
column chromatography, and eluted with chloroform/ 
methanol = 90:10. The ribocytidylpuromycin (CpPur) 
having protection groups was eluted by silica gel TLC 
(developing solvent; chloroform:methanol = 9:1) at an 
Rf of 0.32. Then, protection groups were removed. The 
ribocytidylpuromycin having protecting groups was first 
treated with 80% aqueous solution of acetic acid (0.5 
ml) at room temperature for 1 hour. After the acetic acid 
was removed under reduced pressure, to the residue, 
a mixed solution of aqueous ammonia/ethanol = 2:1 (0.5 
ml) was added. After the mixture was left at room tem- 
perature for 15 hours, the solvent was removed under 
reduced pressure, and to the residue, 1 M solution of 
tetrabutylammonium fluoride in tetrahydrofuran (0.5 ml) 
was added to remove p-cyanoethyl group. Thirty min- 
utes later, the solvent was removed under reduced pres- 
sure, and the residue was subjected to column chroma- 
tography using QAE-Sephadex, and eluted with 0 to 0.5 
M linear gradient of triethylamine carbonate. The eluent 
was collected and lyophilized to afford 1 0 mg of ribocyti- 
dylpuromycin. The synthesized product was confirmed 
to be ribocytidylpuromycin by the facts that it afforded 
equimolar amounts of cytidine and puromycin-5'-phos- 
phate after digestion with nuclease P1 , and that molec- 
ular ions of [M+H]+ were appeared at m/z 777 in MALDI/ 
TOF mass spectrometry. 



<2> Nucleic acid portion (preparation of in vitro virus 
genome) 

Materials: 

5 

[0111] A cell-free protein synthesis system of rabbit 
reticulocyte lysate (Nuclease treated Rabbit reticulocyte 
lysate) was purchased from Promega. T7 RNA polymer- 
ase, T4 DNA ligase, T4 RNA ligase, T4 polynucleotide 

10 kinase, human placenta ribonuclease inhibitor, EcoRL 
BamH\, and deoxyribonucleotides were purchased from 
Takara Shuzo. Restriction enzymes Bs/NI, and BglW 
were purchased from New England Labs. As for [ 35 S]- 
methionine, and [ 32 P]-yATP, thosefrom Amersham, and 

15 as for Taq DNA polymerase, those from Kurabo and 
Grainer were used. As for the other biochemical rea- 
gents, those from Sigma and Wako Pure Chemicals 
were used. A plasmid containing the N-terminal half re- 
gion of human tau protein (amino acid residue numbers 

20 1-165) (pAR3040) was prepared by picking up the full 
length gene of human tau protein by PCR method from 
a cDNA library of human brain cloned in X ZAPI I, intro- 
ducing the gene into a plasmid, amplifying only the N- 
terminal half region by PCR, and introducing the ampli- 

25 fied product into a plasmid. As the PCR (polymerase 
chain reaction) apparatus, Model ASTEC PC800 
(Astec) was used. 



[01 1 2] mRN As encoding the N-terminal half region of 
human tau protein (amino acid residues 1-165) with or 
35 without stop codon, which were ligated with spacer, pep- 
tide acceptor, and rCpPur at its 3'-terminal end, were 
constructed as follows (Figure 8). 

1) A plasmid into which the N-terminal half region 
40 of human tau protein (Goedert, M. (1 989) EMBO J. 

8, 392-399) (pAR3040) was introduced, was linear- 
ized by digestion with a restriction enzyme BglW. 

2) The N-terminal half region (amino acid residue 
numbers 1-165) was amplified from the above ge- 

45 nome by PCR. As primers, used were Leftl (SEQ 
ID NO: 18) for the 5' side, and Rightl (SEQ ID NO: 
19) with stop codon, or Right2 (SEQ ID NO: 20) 
without stop codon for the 3' side. The PCR condi- 
tions consisted of 92°C/30 seconds for denatura- 
50 tion, 65°C/30 seconds for annealing, and 73°C/1 
minute for elongation, and this cycle was repeated 
30 times. 

3) A DNA sequence composed of the promoter re- 
gion ofT7 RNA polymerase, Kozak sequence, and 

55 DNA sequence corresponding to amino acid resi- 
due numbers 1 -25 of human tau protein connected 
in this order (SEQ ID NO: 21) was prepared by 
chemical synthesis. 



(1 ) Preparation of genome 

30 

A. Preparation of DNA for N-terminal half region 



45 
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4) Thetwo kinds of purified DNAobtained in the pro- 
cedures of the above 2) and 3) were connected by 
a two-step PCR as follows. That is, a mixture of the 
aforementioned two kinds of DNA was first ampli- 
fied in the absence of primer, and subsequently am- 
plified in the presence of primers of Left2 (SEQ ID 
NO: 22) and Rightl (SEQ ID NO: 19) or Right2 
(SEQ ID NO: 20). From the above procedure, a 
DNA containing the promoter of T7 RNA polymer- 
ase and Kozak sequence at an upstream side of 
ORF of the N-terminal half region of human tau pro- 
tein was prepared. An RNA was obtained through 
a reaction using this DNA as template, and T7 RNA 
polymerase at 37°C for two hours. 

B. Ligation of Spacer and peptide acceptor 

[0113] Spacer 5 (SEQ ID NO: 23) and a peptide ac- 
ceptor (P-Acceptor, SEQ ID NO: 15), which was a chi- 
meric nucleic acid composed of 21 -nucleotide DNA and 
4-nucleotide RNA, i.e., 25 nucleotides in total, was 
chemically synthesized. The 5 -terminal end of the pep- 
tide acceptor was phosphorylated by reaction at 36°C 
for 1 hour using T4 polynucleotide kinase, and the pep- 
tide acceptor was backed with a splint DNA (SEQ ID 
NO: 24) having asequence complementary thereto, and 
ligated to the 3'-terminal end of Spacer 5 through a re- 
action at 1 6°C for 2 hours using T4 DNA ligase. 

C. Ligation of RNA and Spacer-peptide acceptor 

[0114] The ligation product of Spacer 5-peptide ac- 
ceptor obtained in the above B was phosphorylated at 
the 5 -terminal end through a reaction at 36°C for 1 hour 
using T4 polynucleotide kinase, and ligated to the RNA 
obtained in the above A through a reaction at4°Cfor48 
hours using T4 RNA ligase. 

D. Ligation of rCpPur 

[01 1 5] The rCpPur obtained in the above <1 > Prepa- 
ration of 3'-terminal end portion of nucleic acid portion 
was phosphorylated through a reaction at 15°C for 24 
hours using T4 polynucleotide kinase, and ligated to the 
3'-terminal end of the genome prepared in the above C 
through a reaction at37°Cfor30 minutes using T4 RNA 
ligase. From this procedure, a chimera RNA genome 
having puromycin at its 3'-terminal end could be con- 
structed. 

E. Bonding of rCpPurto C-terminal end of N-terminal 
half of human tau protein 

[01 1 6] It is considered that, in obtaining effective bind- 
ing of a C-terminal end of a protein and an RNA encod- 
ing it, the distance between puromycin and a stop co- 
don, and presence or absence of a stop codon would 
become important factors. Therefore, in order to exam- 



ine effects of these factors, the following three kinds of 
genomes, mRNAs encoding the N-terminal half of hu- 
man tau protein each (1) having a stop codon but not 
having a DNA spacer, (2) having neither of a stop codon 

5 and a DNA spacer, and (3) not having a stop codon but 
having a DNA spacer, at the 3'-terminal end were pre- 
pared. By using these three kinds of genomes, protein 
synthesis was performed in the presence of rCpPur la- 
beled with 32 P in a cell-free translation system utilizing 

10 rabbit reticulocyte lysate (Figure 9). It was found that, 
when the 3'-terminal end did not have a DNA spacer 
rCpPur was bound to the C-terminal of the protein with 
a similar efficiency regardless of the presence or ab- 
sence of a stop codon. That is. in SDS-PAGE (SDS- 

15 polyacrylamide gel electrophoresis), the bands of the 
proteins bonded to rCpPur (the first and the second 
lanes from the left in Figure 9) appeared at the same 
location as that of the protein monomer labeled with 
[ 35 S]-methionine (the most right lane in Figure 9). On 

20 the other hand, if mRNA had a DNA spacer, rCpPur was 
bonded to the C-terminal end of the protein at an effi- 
ciency about three times higher than those obtained in 
the formertwo kinds of mRNA even without a stop codon 
(the third lane from the left, Figure 9). This result can be 

25 considered to indicate that translation pausing of ribos- 
ome occurred on the DNA sequence, and as a result, 
rCpPur and the protein could be bonded efficiently. Fur- 
ther, this result suggests that, when a genome without 
a stop codon having a DNA spacer and rCpPur at its 3'- 

30 terminal end is used as mRNA in a cell-free translation 
system, puromycin at the 3'-terminal end of mRNA can 
efficiently be bonded to the C-terminal end of the corre- 
sponding translated protein. 

35 <3> Construction of in vitro virus in cell-free translation 
system 

[01 1 7] The genome constructed in the above <2> Nu- 
cleic acid portion (preparation of in vitro virus genome) 

40 composed of m RNA encoding the N-terminal half of hu- 
man tau protein (1-165), DNA spacer (1 05 mer), peptide 
acceptor and rCpPur was translated by using rabbit re- 
ticulocyte lysate. When incorporation of [ 35 S]-methio- 
nine into the protein was examined first by using an RNA 

45 encoding the N-terminal half (1 -1 65) of human tau pro- 
tein as mRNA, bands appeared at locations correspond- 
ing to monomer (about 28 KDa) and dimer (about 55 
KDa) of the N-terminal half (1-165). In this case, the 
monomer was the major product, and the dimer was ob- 

50 served in an extremely small amount (the first left lane 
in Figure 10(A)). This result indicates that the RNA en- 
coding the N-terminal half of human tau protein (1 -1 65) 
functioned as mRNA. When the genome composed of 
mRNA encoding the N-terminal half (1-165) of human 

55 tau protein, DNA spacer (105 mer), peptide acceptor 
and rCpPurwas translated in a similar cell-free transla- 
tion system containing [ 35 S]-methionine, and the prod- 
ucts were analyzed in time course (5 minutes, 10 min- 
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utes, 20 minutes, and 40 minutes), a new wide band ap- 
peared at a location slightly above that of the genome 
in addition to the bands at the location of the monomer 
and dimer (the first right lane in Figure 10(A)). The in- 
tensity of this band increased with increase of reaction 
time (the second to fifth lanes from the left in Figure 1 0 
(A)), and increase of the genome amount (Lanes 3 and 
4 in Figure 10(B)). These results indicate that the ge- 
nome was bonded to the C-terminal end of the protein 
with a covalent bond through puromycin. This also 
means that a genotype was covalently bound to a phe- 
notype. That is, a molecule that assigns a genotype to 
a phenotypes was formed. The present inventors des- 
ignated this assigning molecule as in vitro virus. When 
the effect of the length of DNA spacer on the formation 
of in vitro virus was examined, it was found that the in 
vitro virus was not formed efficiently with a length of 
about 80 mer, and it required a length of at least 1 00 mer. 
[0118] Further, the generation of in vitro virus was 
confirmed by using rCpPur labeled with 32 P. That is, a 
genome composed of mRNA encoding the N-terminal 
half (1-165) of human tau protein, DNA spacer (105 
mer), peptide acceptor, and [ 32 P]-rCp Pur was translated 
by using rabbit reticulocyte lysate. The bonding of the 
genome and the protein was confirmed by digestion with 
mung bean nuclease. That is, when thetranslation prod- 
uct (Lane 3 in Figure 11) was digested with mung bean 
nuclease, bands appeared at the locations correspond- 
ing to monomer and dimer (Lane 1 in Figure 11) of the 
N-terminal half (1-165) of human tau protein (Lane 4 in 
Figure 11). This indicates that the rCpPur labeled 
with 32 P was attached to the 3'-terminal end of the pro- 
tein. Also from this result, it was confirmed that the ge- 
nome was bound to the C-terminal end of the protein 
with a covalent bond through puromycin. The efficiency 
of the binding was estimated to be about 10%. Because 
in vitro virus genome having a concentration of 40 to 
1 00 pmol/ml can be prepared, generated in vitro viruses 
would consist of a population containing 2.4 to 6 x 1 0 12 
of mutants, and this number corresponds to 10000times 
of that obtained in the phage display method (Scott, J. 
K. & Smith, G. P. (1990) Science 249, 386-390). The 
genotype assignment to phenotype has advantages, for 
example, it eliminates the problem concerning the per- 
meability, and it enables incorporation of various non- 
naturally-occurring amino acids, and therefore it ena- 
bles synthesis of an extremely large number of mutants, 
or creation of various functional proteins. 

Example3: Protein evolution simulation method utilizing 
in vitro viruses 

[0119] The protein evolution simulation method utiliz- 
ing in vitro viruses comprises, as shown in Figure 12, 
(1 ) construction of in vitro virus genomes, (2) completion 
of in vitro viruses, (3) selection process, (4) introduction 
of mutation, and (5) amplification, and it allows modifi- 
cation and creation of functional proteins. In particular, 



repetition of these steps allows efficient modification 
and creation of functional proteins. Among these steps, 
the steps of (1) and (2) were specifically explained in 
Examples 1 and 2 mentioned above. Therefore, the 

5 steps (3), (4) and (5) will be explained in this example. 
[0120] First, it was examined whether peptides spe- 
cific to an antibody were selected. Specifically, mouse 
IgG was used as the antibody, and the known ZZ region 
of protein A (Nilsson, B., et al., (1 987) Protein Eng., 1 . 

10 1 07-1 1 3) was used as a peptide sequence to be specif- 
ically bound to the antibody. As a control, N-terminal re- 
gion (1-105) of human tau protein (Goedert, M. (1989) 
EMBO J. 8, 392-399) was used. According to the con- 
struction methods of in vitro viruses described in the 

15 above Examples 1 and 2, in vitro virus genomes encod- 
ing the ZZ region of protein A and the N-terminal region 
(1-105) of human tau protein were prepared. With dif- 
ferent ratios of the in vitro virus genome encoding the 
ZZ region of protein A and the in vitro virus genome en- 

20 coding the N-terminal region (1 -1 05) of human tau pro- 
tein varying as 1:1, 1:10, 1 :100, 1 :1000 or the like, they 
were translated in a cell-free translation system utilizing 
rabbit reticulocyte lysate at 30° C for 10 minutes. Then, 
the translation product was diluted, and centrifuged to 

25 remove insoluble fractions, and the supernatant was 
added to a microplate coated with mouse IgG (blocked 
with bovine serum albumin beforehand), and left stand 
at 4°C for 2 hours. Then, the translation product was 
removed from the microplate, and it was washed with a 

30 washing buffer (50 mM Tris/acetic acid, pH 7.5, 1 50 mM 
NaCI, 10 mM EDTA, 0.1%Tween 20) for 6 times in total, 
and eluted 2 times with an elution buffer (1 M acetic acid, 
pH 2.8). The eluted solution was subjected to ethanol 
precipitation, and the precipitates were dissolved in ster- 

35 ile water (20 jxl), and used as a template for reverse tran- 
scription PCR. The reverse transcription PCR was per- 
formed by using reverse transcriptase (Avian Mieloblas- 
tosis Virus Reverse Transcriptase, Promega), DNA 
polymerase (Tfl DNA Polymerase, Promega), and RT+ 

40 (SEQ ID NO: 25) and RT- (SEQ ID NO: 26) as primers. 
Following a reaction at 48°Cfor40 minutes, the reverse 
transcriptase was inactivated with a treatment at 94° C 
for 5 minutes, and a cycle of 94°C for 30 seconds, 66°C 
for 40 seconds, and 72°C for 40 seconds was repeated 

45 30 times. The obtained PCR product was electro- 
phoresed at 55°C on 4% polyacrylamide gel containing 
8 M urea, and observed by silver staining. As a result, 
it was found that the in vitro virus genome containing the 
ZZ region of protein A could be amplified even in an 

50 amount of one-1 00th of the control genome, i.e., the in 
vitro virus genome containing the N-terminal region 
(1 -105) of human tau protein. This result indicates that 
the in vitro virus genome containing the ZZ region of pro- 
tein A was specifically bound to mouse IgG through the 

55 translated ZZ region of protein A. Therefore, it was con- 
firmed that the in vitro viruses could be selected. Intro- 
duction of mutation and amplification can be performed 
by using the already-established error-prone PCR (Le- 
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ung, D. W., et al., (1989) J. Methods Cell Mol. Biol., 1, 
11-15), Sexual PCR (Stemmer, W. P. C. (1994) Proc. 
Natl. Acad. Sci. If USA 91, 10747-10751) or the like. 
Therefore, it was verified that the protein evolution sim- 
ulation method shown in Figure 12 was feasible. 5 

Industrial Applicability 

[0121] According to the present invention, a molecule 
assigning a genotype (nucleic acid portion) to a pheno- 10 
type (protein portion), and construction methods there- 
for are provided. There are also provided a protein ev- 
olution simulation method utilizing molecules that as- 
sign a genotype to a phenotype (in vitro viruses) con- 
structed according to the present invention , which com- is 
prises selecting the in vitro viruses by the in vitro selec- 
tion method, amplifying the gene portion of an extremely 
small amount of the selected in vitro viruses by reverse 
transcription PCR ; and further performing amplification 
while introducing a mutation, and the like. The molecule 20 
assigning the genotype to the phenotype, the protein ev- 
olution simulation method utilizing it and the like of the 
present invention are an extremely useful substance or 
experimental system for evolutionary molecular engi- 
neering, i.e., modification of functional biopolymers 25 
such as enzymes, antibodies, and ribozymes. and cre- 
ation of biopolymers having functions which cannot be 
found in living organisms. 

30 

Claims 

1. A molecule assigning a genotype to a phenotype, 
which comprises a nucleic acid portion having a nu- 
cleotide sequence reflecting the genotype, and a 35 
protein portion comprising a protein involved in ex- 
hibition of the phenotype, the nucleic acid portion 
and the protein portion being directly bound by a 
chemical bond through a substance having a chem- 
ical structure of a member selected from the group 40 
consisting of puromycin, 3'-N-aminoacylpuromycin 
aminonucleoside : and 3'-N-aminoacyladenosine 
aminonucleoside : and said protein portion being 
encoded by said nucleic acid portion. 

45 

2. The assigning molecule according to claim 1 , 
wherein a 3'-terminal end of the nucleic acid portion 
and a C-terminal end of the protein portion are 
bonded with a covalent bond. 

50 

3. The assigning molecule according to any one of 
claim 1 or 2, wherein a 3'-terminal end of the nucleic 
acid portion covalently bonded to a C-terminal end 
of the protein portion is puromycin. 

55 

4. The assigning molecule according to claim 1 , 
wherein the nucleic acid portion comprises a gene 
composed of RNA, and a suppressortRNA bonded 



to the gene through a spacer. 

5. The assigning molecule according to claim A, 
wherein the suppressor tRNA comprises an antico- 
don corresponding to a termination codon of the 
gene. 

6. The assigning molecule according to claim 1 . 
wherein the nucleic acid portion comprises a gene 
composed of RNA, and a spacer composed of DNA 
and RNA. 

7. The assigning molecule according to claim 1 . 
wherein the nucleic acid portion comprises a gene 
composed of RNA, and a spacer composed of DNA 
and polyethylene glycol. 

8. The assigning molecule according to claim 1 . 
wherein the nucleic acid portion comprises a gene 
composed of RNA. and a spacer composed of a 
double-stranded DNA. 

9. The assigning molecule according to claim 1 . 
wherein the nucleic acid portion comprises a gene 
composed of RNA, and a spacer composed of a 
double strand of RNA and a short chain peptide nu- 
cleic acid (PNA) or DNA. 

10. The assigning molecule according to claim 1. 
wherein the nucleic acid portion comprises a gene 
composed of DNA, and a spacer composed of DNA 
and RNA. 

11. A method for constructing the assigning molecule 
as defined in claim 4, which comprises (a) bonding 
a DNA comprising a sequence corresponding to a 
suppressortRNA, to a3'-terminal end of a DNAcon- 
taining a genethrough a spacer, (b) transcribing the 
obtained DNA bonded product into RNA, (c) bond- 
ing : to a 3'-terminal end of the obtained RNA, a nu- 
cleoside or a substance having a chemical structure 
analogoustothat of a nucleoside, which can be cov- 
alently bound to an amino acid or a substance hav- 
ing a chemical structure analogous to that of an ami- 
no acid, and (d) performing protein synthesis in a 
cell-free protein synthesis system using the ob- 
tained bonded product as mRNA to bond a nucleic 
acid portion containing the gene to a translation 
product of the gene. 

12. A method for constructing the assigning molecule 
as defined in claim 6 : which comprises (a) preparing 
a DNA containing a gene which has no termination 
codon, (b) transcribing the prepared DNA into RNA, 

(c) bonding a chimeric spacer composed of DNA 
and RNA to a 3'-terminal end of the obtained RNA, 

(d) bonding, to a 3'-terminal end of the obtained 
bonded product, a nucleoside, or a substance hav- 
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ing a chemical structure analogous to that of a nu- 
cleoside, which can be covalently bound to an ami- 
no acid or a substance having a chemical structure 
analogous to that of an amino acid, and (e) perform- 
ing protein synthesis in a cell-free protein synthesis 5 
system using the obtained bonded product as mR- 
NA to bond a nucleic acid portion containing the 
gene to a translation product of the gene. 

13. The construction method according to claim 11 or 10 
1 2, wherein the nucleoside orthe substance having 

the chemical structure analogous to that of the nu- 
cleoside is puromycin. 

14. A method for constructing the assigning molecule 15 
as defined in claim 7, which comprises (a) preparing 

a DNA containing a gene which has no termination 
codon, (b) transcribing the prepared DNA into RNA, 
(c) bonding a chimeric spacer composed of DNA 
and polyethylene glycol to a 3'-terminal end of the 20 
obtained RNA, (d) bonding, to a 3'-terminal end of 
the obtained bonded product, a nucleoside or a sub- 
stance having a chemical structure analogous to 
that of a nucleoside, which can be covalently bound 
to an amino acid or a substance having a chemical 25 
structure analogous to that of an amino acid, and 
(e) performing protein synthesis in a cell-free pro- 
tein synthesis system using the obtained bonded 
product as mRNA to bond a nucleic acid portion 
containing the gene to a translation product of the 30 
gene. 

15. A method for constructing the assigning molecule 
as defined in claim 8, which comprises (a) preparing 

a DNA containing a gene which has no termination 35 
codon, (b) transcribing the prepared DNA into RNA, 
(c) bonding a spacer composed of double-stranded 
DNA to a 3'-terminal end of the obtained RNA, (d) 
bonding, to a3'-terminal end of the obtained bonded 
product, a nucleoside or a substance having a 40 
chemical structure analogous to that of a nucleo- 
side, which can be covalently bound to an amino 
acid or a substance having a chemical structure 
analogous to that of an amino acid, and (e) perform- 
ing protein synthesis in a cell-free protein synthesis 45 
system using the obtained bonded product as mR- 
NA to bond a nucleic acid portion containing the 
gene to a translation product of the gene. 

16. A method for constructing the assigning molecule 50 
as defined in claim 9, which comprises (a) preparing 

a DNA containing a gene which has no termination 
codon, and a nucleotide sequence of a spacer, (b) 
transcribing the prepared DNA into RNA, (c) bond- 
ing, to a 3'-terminal end of the obtained RNA, a nu- 55 
cleoside or a substance having a chemical structure 
analogous to that of anucleoside ; which can be cov- 
alently bound to an amino acid or a substance hav- 



ing a chemical structure analogous to that of an ami- 
no acid, (d) adding a short chain PNA or DNA to a 
3'-terminal end side portion of the gene in the ob- 
tained RNA bonded product to form a double- 
stranded chain, and (e) performing protein synthe- 
sis in a cell-free protein synthesis system using the 
obtained bonded product as mRNA to bond a nu- 
cleic acid portion containing the gene to a transla- 
tion product of the gene. 

17. A method for protein evolution simulation, which 
comprises a construction step for constructing as- 
signing molecules from a DNA containing a gene 
by the construction method as defined in any one 
of claims 11 . 12, 14, 15 and 16. a selection step for 
selecting the assigning molecules obtained in the 
construction step, a mutation introduction step for 
introducing a mutation into a gene portion of an as- 
signing molecule selected in the selection step, and 
an amplification step for amplifying the gene portion 
obtained in the mutation introduction step. 

1 8. The method for protein evolution simulation accord- 
ing to claim 1 7, wherein the construction step, the 
selection step, the mutation introduction step and 
the amplification step are repeatedly performed by 
providingthe DNA obtained in the amplification step 
to the construction step. 

1 9. A method for assaying protein/protein or protein/nu- 
cleic acid intermolecular action, which comprises a 
construction step for constructing assigning mole- 
cules by the construction method as defined in any 
one of claims 11, 12, 14, 15 and 16, and an assay 
step for examining intermolecular action of the as- 
signing molecules obtained in the construction step 
with another protein or nucleic acid. 



Patentanspruche 

1. Molekul, das einen Genotyp einem Phanotyp zu- 
ordnet, welches einen Nukleinsaureteil umfaBt, der 
eine Nukleotidsequenz hat, welche den Genotyp 
wiedergibt, und einen Proteinteil, der ein Protein 
umfaBt, das beim Aufzeigen des Phanotyps betei- 
ligt ist, wobei der Nukleinsaureteil und der Protein- 
teil durch eine Substanz direkt durch eine chemi- 
sche Bindung mit einer chemischen Struktur ver- 
bunden ist, die von einem Mitglied stammt, das aus 
der Gruppe ausgewahlt ist, welche aus Puromycin, 
3'-N-Aminoacylpuromycinaminonukleosid und 
3'-N-Aminoacyladenosinaminonukleosid besteht, 
und dieser Proteinteil durch diesen Nukleinsaureteil 
kodiert wird. 

2. Das Zuordnungsmolekul gemaB Anspruch 1 , wobei 
ein 3'-terminales Ende des Nukleinsaureteils und 
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ein C-terminales Ende des Proteinteils mit einer ko- 
valenten Bindung verbunden sind. 

3. Das Zuordnungsmolekul gemaB jedem der Ansprii- 
che 1 oder 2, wobei ein 3'-ternninales Ende des Nu- 5 
kleinsaureteils kovalent an das C-terminale Ende 
des Proteinsteils gebunden ist. Puromycin ist. 

4. Das Zuordnungsmolekul gemaB Anspruch 1 , wobei 

der Nukleinsaureteil ein Gen umfaBt, das aus einer 10 
RNA und einer Suppressor-tRNA zusammenge- 
setzt ist, das durch einen Abstandshalter an das 
Gen gebunden ist. 

5. Das Zuordnungsmolekul gemaB Anspruch 4, wobei 15 
die Suppressor-tRNA einen Antikodon umfaBt, der 
dem Terminationskodon des Gens entspricht. 

6. Das Zuordnungsmolekul gemaB Anspruch 1 , wobei 

der Nukleinsaureteil ein Gen umfaBt, das aus einer 20 
RNA und einem Abstandhalter zusammengesetzt 
ist, der aus DNA und RNA besteht. 

7. Das Zuordnungsmolekul gemaB Anspruch 1 , wobei 
der Nukleinsaureteil ein Gen, das aus RNA besteht, 25 
und einen Abstandshalter, der aus DNA und Polye- 
thylenglycol zusammengesetzt ist, umfaBt. 

8. Das Zuordnungsmolekul gemaB Anspruch 1 , wobei 
der Nukleinsaureteil ein Gen, das aus RNA besteht, 30 
und einen Abstandshalter, der aus einer doppelst- 
rangigen DNA besteht, umfaBt. 

9. Das Zuordnungsmolekul gemaB Anspruch 1 , wobei 

der Nukleinsaureteil ein Gen, das aus RNA besteht, 35 
und einen Abstandshalter. der aus einem RN A-Dop- 
pelstrang und einer kurzkettigen Peptidnukleinsau- 
re (PNA) oder DNA zusammengesetzt ist, umfaBt. 

10. Das Zuordnungsmolekul gemaB Anspruch 1, wobei 40 
der Nukleinsaureteil ein Gen, das aus DNA besteht, 
und einen Abstandshalter, der aus DNA und RNA 
zusammengesetzt ist, umfaBt. 

11. Verfahrenzum Herstellen desZuordnungsmolekuls 45 
wie in Anspruch 4 definiert, welches umfaBt (a) Bin- 
den einer DNA, die eine Sequenz umfaBt, welche 
einer Suppressor-tRNA entspricht, mit Hilfe eines 
Abstandshalters an das 3'-terminale Ende einer 
DNA, die ein Gen enthalt, (b) Transkribieren des er- 50 
haltenen DNA-gebundenen Produkts in RNA ; (c) 
Binden eines Nukleosids oder einer Substanz mit 
einer chemischen Struktur, die der eines Nukleo- 
sids analog ist, welche kovalent an eine Aminosau- 

re oder eine Substanz mit einer chemischen Struk- 55 
tur gebunden werden kann, die der einer Aminosau- 
re analog ist, an das 3'-terminale Ende der erhalte- 
nen RNA, und (d) Durchfuhren der Proteinsynthese 



in einem zellfreien Proteinsynthesesystem unter 
Verwendung des erhaltenen gebundenen Produkts 
als mRNA, urn einen Nukleinsaureteil, der das Gen 
enthalt, an ein Translationsprodukt des Gens zu 
binden. 

12. Verfahren zum Konstruieren des Zuordnungsmole- 
kuls wie in Anspruch 6 definiert, welches umfaBt (a) 
Herstellen einer DNA, die ein Gen enthalt, welches 
keinen Terminationskodon hat, (b) Transkribieren 
der hergestellten DNA in RNA, (c) Binden eines chi- 
maren Abstandshalters, der aus DNA und RNA zu- 
sammengesetzt ist, an das 3'-terminale Ende der 
erhaltenen RNA, (d) Binden eines Nukleosids oder 
einer Substanz mit einer chemischen Struktur, die 
der eines Nukleosids analog ist, welche kovalent an 
eine Aminosaure oder an eine Substanz mit einer 
chemischen Struktur gebunden werden kann, die 
der einer Aminosaure analog ist, an das 3'-termina- 
le Ende des erhaltenen gebundenen Produkts, und 
(e) Durchfuhren der Proteinsynthese in einem zell- 
freien Proteinsynthesesystem unter Verwendung 
des erhaltenen gebundenen Produkts als mRNA, 
urn einen Nukleinsaureteil, der das Gen enthalt, an 
ein Translationsprodukt des Gens zu binden. 

13. Konstruktionsverfahren gemaB Anspruch 11 oder 
1 2, wobei das Nukleosid oder die Substanz mit der 
chemischen Struktur die der des Nukleosids ana- 
log ist, Puromycin ist. 

14. Verfahren zum Konstruieren des Zuordnungsmole- 
kuls wie in Anspruch 7 definiert, welches umfaBt (a) 
Herstellen einer DNA, die ein Gen enthalt, welche 
keinen Terminationskodon hat, (b) Transkribieren 
der hergestellten DNA in RNA, (c) Binden eines chi- 
maren Abstandshalters, der aus DNA und Polyethy- 
lenglycol besteht, an ein 3'-terminales Ende der er- 
haltenen RNA, (d) Binden eines Nukleosids oder ei- 
ner Substanz mit der chemischen Struktur, die der 
eines Nukleosids analog ist, welche kovalent an ei- 
ne Aminosaure oder eine Substanz mit einer che- 
mischen Struktur gebunden werden kann, die der 
einer Aminosaure analog ist, an das 3'-terminale 
Ende des erhaltenen gebundenen Produkts, und 
(e) Durchfuhren der Proteinsynthese in einem zell- 
freien Proteinsynthesesystem unter Verwendung 
des erhaltenen gebundenen Produkts als mRNA, 
urn einen Nukleinsaureteil, der das Gen enthalt, an 
ein Translationsprodukt des Gens zu binden. 

15. Verfahren zum Konstruieren des Zuordnungsmole- 
kuls wie in Anspruch 8 definiert, welches umfaBt (a) 
Herstellen einer DNA, die ein Gen enthalt, welches 
keinen Terminationskodon hat, (b) Transkribieren 
der herstellten DNA in RNA, (c) Binden eines Ab- 
standshalters, der aus doppelstrangiger DNA be- 
steht, an ein 3'-terminales Ende der erhaltenen 



35 



21 



41 



EP 0 962 527 B1 



42 



RNA, (d) Binden eines Nukleosids oder eine Sub- 
stanz mit einer chemischen Struktur, die der eines 
Nukleosids analog ist, welche kovalent an eine Ami- 
nosaure oder an eine Substanz mit einer chemi- 
schen Struktur gebunden werden kann, die der ei- 
ner Aminosaure analog ist, an das 3'-terminale En- 
de des erhaltenen gebundenen Produkts, und (e) 
Durchfuhren einer Proteinsynthese in einem zell- 
freien Proteinsynthesesystem unter Verwendung 
des erhaltenen gebundenen Produkts als mRNA, 
urn einen Nukleinsaureteil, der das Gen enthalt, an 
ein Translationsprodukt des Gens zu binden. 

16. Verfahren zum Konstruieren des Zuordnungsmole- 
kuls wie in Anspruch 9 definiert, welches umfaGt (a) 
Herstellen einer DNA. die ein Gen enthalt, welche 
keinen Terminationskodon und eine Nukleotidse- 
quenz eines Abstandshalters hat, (b) Transkribie- 
ren der hergestellten DNA in RNA, (c) Binden eines 
Nukleosids oder einer Substanz mit einer chemi- 
schen Struktur, die der eines Nukleosids analog ist, 
welche kovalent an eine Aminosaure oder eine 
Substanz mit einer chemischen Struktur gebunden 
werden kann, die der einer Aminosaure analog ist, 
an das 3'-terminale Ende der erhaltenen RNA, (d) 
Hinzugeben einer kurzkettigen PNA oder einer 
DNA an den 3'-terminalen Teil des Seitenendes des 
Gens in dem erhaltenen RNA-gebundenen Pro- 
dukt, urn eine doppelstrangige Kette zu bilden, und 
(e) Durchfuhren einer Proteinsynthese in einem 
zellfreien Proteinsynthesesystem unter Verwen- 
dung des erhaltenen gebundenen Produkts als mR- 
NA, urn einen Nukleinsaureteil, der das Gen ent- 
halt, an ein Translationsprodukt des Gens zu bin- 
den. 

17. Verfahren zur Proteinevolutionssimulierung, wel- 
ches einen Konstruktionsschritt zum Konstruieren 
von Zuordnungsmolekulen von DNA, die ein Gen 
enthalt, durch ein Konstruktionsverfahren, das in ir- 
gendeinem der Anspruche 1 1 , 1 2, 14, 15 und 1 6 de- 
finiert ist, einen Selektionsschritt zum Selektionie- 
ren der Zuordnungsmolekule, die in dem Konstruk- 
tionsschritt erhalten wurden, einen Mutations-ein- 
fuhrenden Schritt zum Einfugen einer Mutation in 
einen Teil eines Gens eines Zuordnungsmolekuls, 
welches im Selektionsschritt ausgewahlt wurde, 
und einen Amplifikationsschritt zum Amplifizieren 
des Genteils. welcher in dem Mutations-einfuhren- 
den Schritt erhalten wurde, umfaBt, 

18. Verfahren zur Proteinevolutionssimulierung gemaR 
Anspruch 17, wobei der Konstruktionsschritt, der 
Selektionsschritt, der Mutations-einfuhrende 
Schritt und der Amplifikationsschritt wiederholt 
durchgefuhrt werden, indem die DNA, die in dem 
Amplifikationsschritt erhalten wurde, im Konstrukti- 
onsschritt bereitgestellt wird. 



19. Verfahren zum Untersuchen von intermolekularen 
Protein/Protein- oder Protein/Nukleinsaure-Wir- 
kungen, welches einen Konstruktionsschritt zum 
Konstruieren von Zuordnungsmolekulen mit Hilfe 

5 des Konstruktionsverfahrens, das in irgendeinem 
der Anspruche 11, 12, 14. 1 5 und 16 definiert ist, 
und einen Untersuchungsschritt zum Untersuchen 
der intermolekularen Wirkung der in dem Konstruk- 
tionsschritt erhaltenen Zuordnungsmolekule mit ei- 

10 nem anderen Protein oder einer Nukleinsaure um- 
faGt. 



Revendications 

15 

1. Molecule assignant un genotype a un phenotype, 
qui comprend une partie d'acide nucleique ayant 
une sequence de nucleotides refletant le genotype, 
et une partie de proteine comprenant une proteine 

20 impliques dans la presentation du phenotype, la 
partie d'acide nucleique et la partie de proteine 
etant directement liees par une liaison chimique par 
I'intermediaire d'une substance ayant une structure 
chimique d'un membre choisi dans le groupe cons- 

25 titue d'une puromycine, d'un 3'-N-aminoacylpuro- 
mycine aminonucleoside, et d'un 3'-N-aminoacyla- 
denosine aminonucleoside. et ladite partie de pro- 
teine etant codee par ladite partie d'acide nuclei- 
que. 

30 

2. Molecule d'assignation selon la revendication 1 . 
dans laquelle une extremite 3'-terminale de la partie 
d'acide nucleique et une extremite C-terminale de 
la partie de proteine sont liees avec une liaison co- 

35 valente. 

3. Molecule d'assignation selon I'une quelconque des 
revendications 1 ou 2. dans laquelle une extremite 
3'-terminale de la partie d'acide nucleique liee de 

40 maniere covalente a une extremite C-terminal de la 
partie de proteine est une puromycine. 

4. Molecule d'assignation selon la revendication 1 . 
dans laquelle la partie d'acide nucleique comprend 

45 un gene compose d'ARN. et un ARNt suppresseur 
lie au gene a travers un espaceur. 

5. Molecule d'assignation selon la revendication 4, 
dans laquelle I'ARNt suppresseur comprend un an- 

50 ticodon correspondant a un codon de terminaison 
du gene. 

6. Molecule d'assignation selon la revendication 1 . 
dans laquelle la partie d'acide nucleique comprend 

55 un gene compose d'ARN, et un espaceur compose 
d'ADN et d'ARN. 

7. Molecule d'assignation selon la revendication 1 . 
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dans laquelle la partie d'acide nucleique comprend 
un gene compose d'ARN, et un espaceur compose 
d'ADN et de polyethylene glycol. 

8. Molecule designation selon la revendication 1,. 5 
dans laquelle la partie d'acide nucleique comprend 

un gene compose d'ARN, et un espaceur compose 
d'un ADN a double brin. 

9. Molecule designation selon la revendication 1, 10 
dans laquelle la partie d'acide nucleique comprend 

un gene compose d'ARN, et un espaceur compose 
d'un ARN a double brin et d'un acide nucleique pep- 
tidique (PNA) a chame courte ou d'ADN. 

15 

10. Molecule d'assignation selon la revendication 1, 
dans laquelle la partie d'acide nucleique comprend 
un gene compose d'ADN, et un espaceur compose 
d'ADN et d'ARN. 

20 

11. Procede pour construire la molecule d'assignation 
comme definie dans la revendication 4, qui com- 
prend (a) de lier un ADN comprenant une sequence 
correspondant a un ARNt suppresseur, a une ex- 
tremite 3'-terminale d'un ADN contenant un gene 25 
par I'intermediaire d'un espaceur. (b) de transcrire 

le produit lie a I'ADN obtenu en ARN, (c) de lier, a 
une extremite 3'-terminale de I'ARN obtenu, un nu- 
cleoside ou une substance ayant une structure chi- 
mique analogue a celle d'un nucleoside, qui peut 30 
etre lie de maniere covalente a un acide amine ou 
a substance ayant une structure chimique analogue 
a celle d'un acide amine, et (d) d'effectuer une syn- 
thase de proteines dans un systeme de synthese 
de proteines exempt de cellules en utilisant le pro- 35 
duit lie obtenu en tant que ARNm afin de lier une 
partie d'acide nucleique contenant le gene a un pro- 
duit de traduction du gene. 

12. Procede pour construire la molecule d'assignation 40 
comme definie dans la revendication 6, qui com- 
prend (a) de preparer un ADN contenant un gene 

qui n'a pas de codon de terminaison, (b) de trans- 
crire I'ADN prepare en ARN, (c) de lier un espaceur 
chimerique compose d'ADN et d'ARN a une extre- 45 
mite 3'-terminale de I'ARN obtenu, (d) de lier, a une 
extremite 3'-terminale du produit lie obtenu, un nu- 
cleoside, ou une substance ayant une structure chi- 
mique analogue a celle d'un nucleoside, qui peut 
etre lie de maniere covalente a un acide amine ou 50 
a substance ayant une structure chimique analogue 
a celle d'un acide amine, et (e) d'effectuer une syn- 
thase de proteines dans un systeme de synthese 
de proteines exempt de cellules en utilisant le pro- 
duit lie obtenu en tant que ARNm afin de lier une 55 
partie d'acide nucleique contenant le gene a un pro- 
duit de traduction du gene. 



13. Procede de construction selon la revendication 11 
ou 12, dans lequel le nucleoside ou la substance 
ayant la structure chimique analogue a celle d'un 
nucleoside est une puromyoine. 

14. Procede pour construire la molecule d'assignation 
comme definie dans la revendication 7, qui com- 
prend (a) de preparer un ADN contenant un gene 
qui n'a pas de codon de terminaison, (b) de trans- 
crire I'ADN prepare en ARN, (c) de lier un espaceur 
chimerique compose d'ADN et de polyethylene gly- 
col a une extremite 3'-terminale de I'ARN obtenu, 
(d) de lier, a une extremite 3'-terminale du produit 
lie obtenu, un nucleoside ou une substance ayant 
une structure chimique analogue a celle d'un nu- 
cleoside, qui peut etre lie de maniere covalente a 
un acide amine ou a substance ayant une structure 
chimique analogue a celle d'un acide amine, et (e) 
d'effectuer une synthese de proteines dans un sys- 
teme de synthese de proteines exempt de cellules 
en utilisant le produit lie obtenu en tant que ARNm 
afin de lier une partie d'acide nucleique contenant 
le gene a un produit de traduction du gene. 

15. Procede pour construire la molecule d'assignation 
comme definie dans la revendication 8, qui com- 
prend (a) de preparer un ADN contenant un gene 
qui n'a pas de codon de terminaison, (b) de trans- 
crire I'ADN prepare en ARN, (c) de lier un espaceur 
compose d'ADN a double brin a une extremite 3'- 
terminale de I'ARN obtenu, (d) de lier, a une extre- 
mite 3'-terminale du produit lie obtenu, un nucleo- 
side ou une substance ayant une structure chimi- 
que analogue a celle d'un nucleoside, qui peut etre 
lie de maniere covalente a un acide amine ou a 
substance ayant une structure chimique analogue 
a celle d'un acide amine, et (e) d'effectuer une syn- 
these de proteines dans un systeme de synthese 
de proteines exempt de cellules en utilisant le pro- 
duit lie obtenu en tant que ARNm afin de lier une 
partie d'acide nucleique contenant le gene a un pro- 
duit de traduction du gene. 

16. Procede pour construire la molecule d'assignation 
comme definie dans la revendication 9, qui com- 
prend (a) de preparer un ADN contenant un gene 
qui n'a pas de codon determinaison, et unesequen- 
ce de nucleotides d'un espaceur, (b) de transcrire 
I'ADN prepare en ARN, (c) de lier, a une extremite 
3'-terminale du produit lie obtenu. un nucleoside, ou 
une substance ayant une structure chimique ana- 
logue a celle d'un nucleoside qui peut etre lie de 
maniere covalente a un acide amine ou a substance 
ayant une structure chimique analogue a celle d'un 
acide amine, (d) d'ajouter un PNA a chaine courte 
ou un ADN a une partie laterale de I'extremite 3'- 
terminale du gene dans le produit lie d'ARN obtenu 
afin de former une chaTne a double brin, et (e) d'ef- 
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fectuer une synthese de proteines dans un systeme 
de synthese de proteines exempt de cellules en uti- 
lisant le produit lie obtenu an tant que ARNm afin 
de lier une partie d'acide nucleique contenant le ge- 
ne a un produit de traduction du gene. 5 

17. Procede pour une simulation devolution de protei- 
ne, qui comprend une etape de construction afin de 
construire des molecules d'assignation a partir 
d'ADN contenant un gene par le procede de cons- 10 
truction comme defini dans I'une quelconque des 
revendications 11 , 12, 14, 15 et 1 6, une etape de 
selection afin de choisir les molecules d'assignation 
obtenues dans I'etape de construction, une etape 
d'introduction de mutation pour introduire une mu- is 
tation dans une partie de gene d'une molecule d'as- 
signation choisie dans I'etape de selection, et une 
etape d'amplification afin d'amplifier la partie de ge- 
ne obtenue dans I'etape d'introduction de mutation. 

20 

18. Procede pour une simulation devolution de protei- 
ne selon la revendication 17, dans lequel I'etape de 
construction, I'etape de selection, I'etape d'intro- 
duction de mutation et I'etape d'amplification sont 
effectuees de maniere repetee en fournissant I'ADN 25 
obtenu dans I'etape d'amplification a I'etape de 
construction. 

19. Procede pour analyser une action intermoleculaire 
proteine/proteine ou proteine/acide nucleique, qui 30 
comprend une etape de construction afin de cons- 
truire des molecules d'assignation par le procede 

de construction comme defini dans I'une quelcon- 
que des revendications 11 , 12, 14, 15 et 16, et une 
etape d'analyse pour examiner Taction intermolecu- 35 
laire des molecules d'assignation obtenues dans 
I'etape de construction avec une autre proteine ou 
acide nucleique. 
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Strategy for assigning genotype to phenotype 




Genotype and phenotype: 

Carried on 

tne same molecule 




Form complex 




Contained in 
single compartment 



Strategies for assignment are logically classified into three patterns. 



FIG. 1 
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FIG. 5 
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FIG. 11 
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